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Abstract 

We present in this paper a first-order axiomatization of an extended theory T of finite 
or infinite trees, built on a signature containing an infinite set of function symbols and 
a relation fimte{t) which enables to distinguish between finite or infinite trees. We show 
that T has at least one model and prove its completeness by giving not only a decision 
procedure, but a full first-order constraint solver which gives clear and explicit solutions 
for any first-order constraint satisfaction problem in T. The solver is given in the form of 16 
rewriting rules which transform any first-order constraint into an equivalent disjunction 
(j) of simple formulas such that (j> is either the formula true or the formula false or a 
formula having at least one free variable, being equivalent neither to true nor to false and 
where the solutions of the free variables are expressed in a clear and explicit way. The 
correctness of our rules implies the completeness of T. We also describe an implementation 
of our algorithm in CHR (Constraint Handling Rules) and compare the performance with 
an implementation in C-I-+ and that of a recent decision procedure for decomposable 
theories. 

KEYWORDS: Logical first-order formula. Theory of finite or infinite trees. Complete 
theory. Rewriting rules. 



1 Introduction 



The algebra of finite or infinite trees plays a fundamental role in computer science: it 
is a model for data structures, program schemes and program executions. As early as 
1930, J. Herbrand (Herbrand 1930 1 gave an informal description of an algorithm for 



unifying finite terms, that is solving equations in finite trees. A. Robinson (Robinson 



e: 
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1965 1 rediscovered a similar algorithm when he introduced the resolution procedure 



for first-order logic in 1965. Some algorithms with better complexities have been 



proposed after by M.S. Paterson and M.N.Wegman (Paterson and Wegman 1978) 



and A. Martelli and U. Montanari (Martelli and Montanari 19821. A good syn- 



thesis on this field can be found in the paper of J. P. Jouannaud and C. Kirchner 



( Jouannaud and Kirchner 1991 1. Solving conjunctions of equations on infinite trees 



has been studied by G. Huet (Huet 19761, by A. Colmerauer (Colmerauer 1982) 



and by J. Jaffar ( Jaffar 19841. Solving conjunctions of equations and disequations 



on finite or infinite trees has been studied by H.J. Burckert (Burkert 1988) and 
A. Colmerauer (Colmerauer 19841. An incremental algorithm for solving conjunc- 



tions of equations and disequations on rational trees has then been proposed by 
V.Ramachandran and P. Van Hentenryck ( Ramachandran and Van Hentenryck 



1993 1 and a quasi-linear incremental algorithm for testing entailment and disentail- 



ment over rational trees has been given by A. Podelski and P. Van Roy (Podelski 



and Van Roy 19941 



On the other hand, K.L. Clark has proposed a complete axiomatization of the 
equality theory, also called Clark equational theory CET, and gave intuitions about 
a complete axiomatization of the theory of finite trees (Clark 19781. B. Cour- 



celle has studied the properties of infinite trees in the scope of recursive program 



schemes (Courcelle 1983 Courcelle 19861 and A. Colmerauer has described the 



execution of Prolog II, III and IV programs in terms of solving equations and dis- 



equations in the algebra of finite or infinite trees (Colmerauer 1984 Colmerauer 



1990 Benhamou et al. 1996 1 



Concerning quantified constraints, solving universally quantified disequations on 



finite trees has been studied by D.A. Smith (Smith 1991 1 and there exist some deci- 



sion procedures which transform any first-order formula into a Boolean combination 
of quantified conjunctions of atomic formulas using elimination of quantifiers. In 



the case of finite trees we can refer to A. Malcev (Malcev 1971 1, K. Kunen (Kunen 



19871 and H. Comon (Comon 1988 Comon 1991b 



Comon and Lescanne 1989 1. For 



infinite trees, we can refer to the work of H. Comon (Comon 1988 Comon 1991a 



and M. Maher ( [Maher 1988 1. 

M. Maher has axiomatized all the cases by complete first-order theories (Maher 



1988 1. In particular, he has introduced the theory T of finite or infinite trees built on 



an infinite set F of function symbols and showed its completeness using a decision 
procedure which transforms any first-order formula ip into a Boolean combination 
(j) of quantified conjunctions of atomic formulas. If ip does not contain free variables 
then 4> is either the formula true or false. 



K. Djelloul has then presented in (Djelloul 2006a I the class of decomposable 



theories and proved that the theory of finite or infinite trees is decomposable. He 
has also given a decision procedure in the form of five rewriting rules which, for 
any decomposable theory, transforms any first-order formula ip into an equivalent 
conjunction (f> of solved formulas easily transformable into a Boolean combination 
of existentially quantified conjunctions of atomic formulas. In particular, if ip has 
no free variables then </> is either the formula true or -^true. 

Unfortunately, all the preceding decision procedures are not able to solve complex 
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first-order constraint satisfaction problems in T. In fact, these algorithms are only 
basic decision procedures and not full first-order constraint solvers: they do not 
warrant that the solutions of the free variables of a solved formula are expressed 
in a clear and explicit way and can even produce, starting from a formula ip which 
contains free variables, an equivalent solved formula (j) having free variables but 
being always false or always true in T. The appropriate solved formula of ip in this 
case should be the formula false or the formula true instead of <j>. If we use for 
example the decision procedure of ( [Djelloul 2006a[ ) to solve the following formula ip 

^{^yx = f{y) A -n{3zwx^ f{z) A w = f{w))), 

then we get the following sol vecj^ formula 4> 

-^i3yx^fiy)A^i3zx = f{z))). 

The problem is that this formula contains free variables but is always true in the 
theory of finite or infinite trees. In fact, it is equivalent to 

^{3yx = f{y) A ^{3zx = f{y) A x = /(z))), 

i.e. to 

^{3yx = f{y) A ^{x ^ f{y) A (3zz = y))), 

thus to 

^{3yx^ f{y) A -(x = f{y))), 
which is finally equivalent to true. As a consequence, the solved formula of if should 
be true instead of (j). This is a good example which shows the limits of the decision 
procedures in solving first-order constraints having at least one free variable. 

Much more elaborated algorithms are then needed, specially when we want to 
induce solved formulas expressing solutions of complex first-order constraint satis- 
faction problems in the theory of finite or infinite. Of course, our goal in these kinds 
of problems is not only to know if there exist solutions or not, but to express these 
solutions in the form of a solved first-order formula which is either the formula 
true (i.e. the problem is always satisfiable) or the formula false (i.e. the problem 
is always unsatisfiable) or a simple formula which is neither equivalent to true nor 
to false and where the solutions of the free variables are expressed in a clear and 
explicit way. Algorithms which are able to produce such a formula (j) are called 
first-order constraint solvers. 



We have then presented in (Djelloul and Dao 2006b I, not only a decision proce- 



dure, but a full first-order constraint solver in the theory T of finite or infinite trees, 
in the form of 11 rewriting rules, which gives clear and explicit solutions for any 
first-order constraint satisfaction problem in T. The intuitions behind this algo- 



rithm come from the works of T. Dao in (Dao 2000) where many elegant properties 



of the theory of finite or infinite trees were given. As far as we know, this is the 
first algorithm which is able to do a such work in T. 

This is an extended and detailed version with full proofs of our previous work on 



is solved according to Definition 4.2.4 of I Djelloul 2006a 
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the theory T of finite or infinite trees (Djelloul and Dao 2006b'). Moreover, in this 
paper we extend the signature of T by the relation finite (t) which forces the term 
t to be a finite tree. Then we extend Maher's axiomatization by two new axioms 
and show its completeness by giving an extended version of our previous first-order 
constraint solver ( [Djelloul and Dao 2006b I. We also describe a CHR (Constraint 
Handling Rules) implementation of our rules and compare the performances with 
those obtained using a C++ implementation of our solver and the decision proce- 
dure for decomposable theories (Djelloul 2006a). 



Overview of the paper 

This paper is organized in five sections followed by a conclusion. This introduction 
is the first section. In section 2, we introduce the structure of finite or infinite trees 
and give formal definitions of trees, finites trees, infinite trees and rational trees. 
We end this section by presenting particular algebras which handle finite or infinite 
trees. 

In section 3, after a brief recall on first-order logic, we present the five axioms of 
our extended theorjj^T of finite or infinite trees built on a signature containing not 
only an infinite set of function symbols, but also a relation finite{t) which enables 
to distinguish between finite or infinite trees. We then extend the algebras given at 
the end of section 2 by the relation finite{t) and show that these extended algebras 
are models of T. In particular, we show that the models of sets of nodes, of finite 
or infinite trees and of rational trees are models of T. 

In section 4, we present structured formulas that we call working formulas and 
give some of their properties. These working formulas are extensions of those given 



in (Djelloul 2006a). We also introduce the notion of reachable variables and show 
that there exist particular formulas which have only quantified reachable variables, 
do not accept elimination of quantifiers and cannot be simplified any further. Such 
formulas are called general solved formulas. We then present 16 rewriting rules 
which handle working formulas and transform an initial working formula into an 
equivalent conjunction of final working formulas from which we can extract easily 
an equivalent conjunction of general solved formulas. We end this section by a full 
first-order constraint solver in T. This algorithm uses, among other things, our 
16 rules and transforms any first-order formula ip into a disjunction of simple 
formulas such that is either the formula true or the formula false or a formula 
having at least one free variable, being equivalent neither to true nor to false and 
where the solutions of the free variables are expressed in a clear and explicit way. 
The correctness of our algorithm implies the completeness of T. 

Finally, in section 5, we give a series of benchmarks. Our algorithm was imple- 



mented in C++ and CHR (Fruehwirth 1998 Fruehwirth and Abdennadher 2003 



Schrijvers and Fruehwirth 2006). The C++ implementation is able to solve for- 



mulas of a two player game involving 80 nested alternated quantifiers. Even if the 



^ We have chosen to denote by T the Maher's theory of finite or infinite trees and by T our 
extended theory of finite or infinite trees. 
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CH — h implementation is fastest, we found interesting to see how we can translate 
our algorithm into CHR rules. Using this high-level approach, we will be able to 
quickly prototype optimizations and variations of our algorithm and hope to par- 
allelize it. We also compare the performances with those of C-|— I- implementation 
of the decision procedure for decomposable theorie^ ( [Djelloul 2006a I . 

The axiomatization of T, the proof that T has at least one model, the 16 rewriting 
rules, the proof of the correctness of our rules, the first-order constraint solver in 
T, the completeness of T, the CHR implementation, the two player game and the 
benchmarks are new contributions in this paper. 



2 The structure of finite or infinite trees 
2.1 What is a tree? 

Trees are well known objects in the computer science world. Here axe some of them: 



\ 



aba b f ^ 



b 



a f 
s 

I 



I 

I I 



I 



their nodes are labeled by the symbols a,b,f,s of respective arities 0,0,2,1. While 
the first tree is a finite tree, i.e. it has a finite set of nodes, the two others are 
infinite trees, i.e. they have an infinite set of nodes. 

Let us now number from 1 to n and from left to right the branches that connect 
each node I to his n sons. We get: 





... 11 ... 

1 

1 

Each node c labeled by I can now be seen as a pair (p, V) where p is the position of 
the node, i.e. the smallest series of positive integers that we meet if we move from 

^ In i jDjelloul 2006a[ |, we have shown that the Maher's theory 1 of finite or infinite trees is 
decomposable. We can show easily using a similar proof that our extended theory T is also 
decomposable. 
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the root of the tree to the node c. Thus, the preceding trees can be represented by 
the following sets of nodes: 

{(e, /),(!,/), (2, ,s), (11, a), (12, 6), (21, a)} 
{(e, /), (1, a), (2, /), (21, b), (22, /), (221, a), (222, /), (2221, 6), ...} 
(£, /), (1, a), (2, /), (21, s), (22, /), (211, a), (221, s), (222, /), 
(2211, s), (2221, s), (2222, /), (22111, a), (22211, s), (222111, s), (2221111, a), .. 

Let us now formalize all the preceding statements. Let L be a (possibly infinite) 
set. Its elements are called labels. To each label I G Lis linked a non-negative integer 
called arity of I. An n-ary label is a label of arity n. A position is a word built on 
strictly positive integers (the empty word is denoted by e). Let p be a position and 
I a label. The pair {p, I) is called node and its depth is the lengtlj^ of p. An n-ary 
node is a node whose label is of arity n. A root is a node of depth 0. The row of 
an n-ary node, with n ^ 0, is the last integer of its position. We say that c is the 
father of c' or c' is the son of c if c and c' are nodes whose positions are respectively 
of the form ii...ik and ii...ikik+i, where the ij's are strictly positive integers and k 
a (possibly nulQ positive integer. Let us denote by N the set of the nodes labeled 
by elements of L. 

Definition 2.1.1 

A node c of is called arborescent in a sub-set A^i of if Ai 7^ and either 
c ^ A^i, or c € A^i and the two following conditions hold: 

• A^i — {c} does not contain any node whose position is the same than those of 

• c is either a root or the son of an n-ary node of A'^i which has exactly n sons 
in A^i of respective rows 1, n. 

We can now define formally a tree: 

Definition 2.1.2 

A tree tr is a sub-set of A^ such that each element of A^ is arborescent in tr. A 
finite tree is a tree whose set of nodes is finite. An infinite tree is a tree whose set 
of nodes is infinite. 

Let us now define the notion of subtree: 

Definition 2.1.3 

Let tr be a tree. The subtree linked to a node {ii...ik, I) of tr is the set of the nodes 
of the form {ik+i...ik+7nl') with {ii...ik+n,l') G tr ancj^ n > 0. We call subtree of 
tr a subtree linked to one of the nodes of tr. A subtree of tr of depth fc is a subtree 
linked to a node of tr of depth k. 



From Definition 2.1.2| we deduce that each subtree of a tree tr is also a tree. 



* As usual, the length of the empty word e is 0. 

^ Of course, for k = 0, ii...ik is reduced to e. 

® Of course, for n = 0, i') is reduced to {e,l'). 
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Definition 2.1.4 

A rational tree is a tree whose set of subtrees is a finite set. 

Note that an infinite tree can be rational. In fact, even if its set of nodes is infinite 
but n subtrees linked to n different nodes can be similar. Let us see this in the 
following example: 

Example 2.1.5 



Let us consider the three trees presented in the beginning of Section 2.1 Let us 
name them from left to righ by: tri, tr2 and tr^. The set of the subtrees of tri is 
the following finite set: 



< {(£,s),(l,a)}, y 

{(e,/),(l,a),(2,6)}, 
I {(£,/), (l,/),(2,s), (11, a), (12,6), (21,a)} J 

i.e. 




The set of the subtrees of tr2 is the following finite set: 



{(£,6)}, 

{(e, /), (1, a), (2, /), (21, 6), (22, /), (221, a), ...}, 
{(e, /), (1,6), (2, /), (21, a), (22, /), (221, 6), ...} 



I.e. 



The set of the subtrees of tr^ is the following infinite set: 



{(£,s),(l,a)}, 

{(£,s),(l,s),(ll,a)}, 

{(£,s),(l,s),(ll,s),(lll,a)}, 

{(£,/), (l,a), (2, /),(21,s), (22,/),...}, 

{(£, /), (1, s), (2, /), (11, a), (21, s), (22, /), ...} 
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I.e. 



a 





- 



a 



I -I /\ 

I /\ I I 

II - I 

a S s 

I I 

£ a 



Note that the tree tri has a finite set of nodes and a finite set of subtrees. Thus, 
it is a finite rational tree. The tree tr2 has an infinite set of nodes but a finite set 
of subtrees. Thus, it is an infinite rational tree. The tree trs has an infinite set of 
nodes and an infinite set of subtrees. Thus, it is an infinite non-rational tree. 

Note also that a rational tree can always be represented by a finite directed graph. 
For that, it is enough to merge all the nodes whose linked subtrees are similar. A 
non-rational tree cannot be represented by a finite directed graph. In this case, only 
an infinite directed graph representation will be possible. For example, the trees tri, 
tr2 and tr^ can be represented as follows: 



Of course, two different directed graphs can represent the same tree. For example 
the trees tr2 and tr^ can also be represented as follows: 



a 




r 



r 






t 



s 
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2.2 Construction operations 



We would like to provide the set Tr of finite or infinite trees with a set of construc- 
tion operations ; one for each label I of L. These operations will be schematized as 
follows: 



with n the arity of the label I. In order to formally define these construction 
operations, we need first to define them in the set D of sets of node^of N. Let i 
be a strictly positive integer. If d = (ji...jk, I) is a node then we denote by i.d the 
node {iji...jk, I)- If a is a set of nodes (i.e. a E D), then we denote by i.a the set of 
nodes {i.d \ d S a}. 

Definition 2.2.1 

In the set D, the construction operation linked to the n-ary label I is the application 
: (fli, a„) 1-^ {(e, I)} U l.ai U ... U n.On with ai...a„ elements of D. 

Remark 2.2.2 

Let a be an element of D. Let us denote by Vk{a) the set of nodes of a of depth k. 
Many remarks must be stated concerning any elements a, and b of D: 

1. a^b^ /\^^ Vk{a) = Vkib). 

2. z.oG''(ai,...,a„)) = {(£,/)}. 

3. For all /c > 0, there exists a function ^Pk+i which is independent from 
all the Vk+i{ai), with i G {!,... ,7i}, such that Vk+i{l^ {ai^ ...^an)) = 
^k+i{vk{ai), Vk{an))- 

4. The elements of i/o(Z^(ai, a„)) are arborescent in Z^(ai, a„). 

5. For all fc > 0, the elements of i^fc+i(/^(ai, a„)) are arborescent in 
^^(ai, a„) if and only if, for each i G {!,..., n}, the elements of Vk{ai) 
are arborescent in a^. 

6. If for all A: > the elements of i/fc(/^(ai, a„)) are arborescent in 
l^{ai, a„) then each element of N is arborescent in l^{ai, a„). 

Let now F be an infinite set of function symbols. Let us denote by: 

• N the set of the nodes labeled by F, 

• D the set of sets of nodes of N, 

• Tr the set of the elements of D which are trees, 

• Ra the set of the elements of Tr which are rational, 

• Fi the set of the elements of Tr which are finite. 




In other words, each element of D is a set of nodes, i.e. a subset of N. 
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If / is an n-ary function symbol taken from F then the operation of construction 
associated to / is an application of the form D. Let tri, tr„ be elements of 



Tr. From the fourth and fifth point of Remark 2.2.2 we deduce that f^{tri, trn) 



is also a tree, i.e. an element of Tr. Thus, we can introduce the following application: 

Z^'' : {tri, ...,trn) ^^ /^(iri, ...,ir„) which is of type Tr" Tr. 

On the other hand, the set of the subtrees of the tree f'^{tri,...,trn) is obtained 
by the union of the sets of the subtrees of all the tri plus the tree f^{tri, ...,tr„). 
Thus, if all the ir^'s are rational trees then the tree f^{tri, ...,ir„) is rational. As 
a consequence, we can introduce the following application: 

jRa . ^^^^^ ...,tr„) f^{tri, ...,tr„) which is of type i?a" — > Ra. 

Finally, if all the ir^'s are finite trees, then the tree f^{tri, ...,trn) is finite. Thus, 
we can introduce the following application: 

f^^ : {tri, ■■■,trn) f^{tri, ...,tr„) which is of type Fi" Fi. 

The pairs < D,{f°)f^F >, < Tr,(/^'-)/eF >, < > and < 

Ra, {f^"^) f^p > are known as the algebras of sets of nodes, of finite or infinite 
trees, of finite trees and of rational trees. 



3 The extended theory T of finite or infinite trees 
3.1 Formal preliminaries 

3.1.1 Formulas 

We are given once and for all an infinite countable set V of variables and the set L 
of logical symbols: 

=, true, false, ^, A, V, ^, V, 3, (, ). 

We are also given once and for all a signature S, i.e. a set of symbols partitioned 
into two subsets: the set of Junction symbols and the set of relation symbols. To 
each element s of 5' is linked a non-negative integer called arity of s. An n-ary 
symbol is a symbol of arity n. A 0-ary function symbol is called constant. 

As usual, an expression is a word on i U 5 U y which is either a term, i.e. of one 
of the two forms: 

X, f{ti, . . .,tn), (1) 
or a formula, i.e. of one of the eleven forms: 

s — t, r{ti, . . . ,tn), true, false, 

-.LP, {(fi^tl^), {(fVlp), {(f^lp), (2) 

(Vxip), {3x(p). 

In ([l]), X is taken from V, f is an n-ary function symbol taken from S and the t^'s 
are shorter terms. s,t and the t^'s are terms, r is an n-ary relation symbol 

taken from S and Lp and ip are shorter formulas. The set of the expressions forms a 
first-order language with equality. 
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The formulas of the first fine of (|2| are known as atomic, and flat if they are of 
one of the following forms: 

true, false, xq ^ xi,xo ^ f{xi, ...,Xn), r{xi, ...,x„), 

where all the x^'s are (possibly non-distinct) variables taken from V, f is an n-ary 
function symbol taken from S and r is an n-ary relation symbol taken from S. An 
equation is a formula of the form s ^ t with s and t terms. 

An occurrence of a variable a; in a formula is bound if it occurs in a sub-formula 
of the form (yxip) or {3x(f). It is free in the contrary case. The free variables of 
a formula are those which have at least one free occurrence in this formula. A 
proposition or a sentence is a formula without free variables. If is a formula, then 
we denote by var(ip) the set of the free variables of tp. 

The syntax of the formulas being constraining, we allowed ourselves to use infix 
notations for the binary symbols and to add and remove brackets when there are 
no ambiguities. Moreover, we do not distinguish two formulas which can be made 
equal using the following transformations of sub-formulas: 

(fi A ip =^ ip, ip Atp ip A ip, {(p Alp) A (j) =4> (p A (i/j A (/)), 
ip A true ip, ipy false =^ ip. 

If / is the set {ii, i„}, we call conjunction of formulas and write Aig/ each 
formula of the form (^j^ A(pi^ A...Aipi^Atrue. In particular, for / = 0, the conjunction 
is reduced to true. 

3.1.2 Model 

A model is a tuple M = < M, {f^)feF, (-R*^)refl >, where: 

• M , the universe or domain of A^, is a nonempty set disjoint from S, its 
elements are called individuals of Ad ; 

• F and R are sets of n-ary functions and relations in the set M , subscripted 
by the elements of S and such that: 

— for every n-ary function symbol / taken from 5*, is an n-ary opera- 
tion in M , i.e. an application from M" in M . In particular, when / is 
a constant, /^^ belongs to M; 

— for every n-ary relation symbol r taken from S, r^'^ is an n-ary relation 
in M, i.e. a subset of Af". 

Let M — < M,F,R > be a model. An M-expression ip is an expression built 
on the signature S U M instead of S, by considering the elements of M as 0-ary 
function symbols. If for each free variable x of ip we replace each free occurrence of 
X by a same element m in M, we get an Al-expression (p' called instantiatior^or 
valuation of (p by individuals of ^A. 

* We also say that the variable x is instantiated by m in 93'. 
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If if is an Al-formula, we say that ip is true in M and we write 

X h V', (3) 

if for any instantiation ip' of </? by individuals of M the set M has the property 
expressed by ipt, when we interpret the function and relation symbols of tpi by the 
corresponding functions and relations of M and when we give to the logical symbols 
their usual meaning. 

Remark 3.1.3 

For every A^-formula Lp without free variables, one and only one of the following 
properties holds: M \^ ip, M \^ ^(^. 

Let us finish this sub-section by a convenient notation. Let be a word 

on V and let i = ii...i„ be a word on M or V of the same length as x. If (p{x) and 
(j) are two A^-formulas, then we denote by <p(j), respectively 4>x^i i the A^-formula 
obtained by replacing in <p(a;), respectively in (f), each free occurrence of Xj by ij. 



3.1.4 Theory 

A theory is a (possibly infinite) set of propositions called axioms. We say that the 
model Al is a model ofT, if for each clement ip of T, Ai |= If is a formula, we 
write 

if for each model A4 ofT, A4 \= (p. We say that the formulas (p and ip are equivalent 
in T if T \= p> tp . 

Definition 3.1.5 

A theory T is complete if for every proposition ip, one and only one of the following 
properties holds: T \^ <p, T \^ -^ip. 

Let (/) be a formula and be a word on V such that var((j)) — x. From 

the preceding definition we deduce that a decision procedure is sufficient in the 
case where we want just to show the completeness of a theory T, as it was done in 



(Djelloul 2006a I for decomposable theories. In fact, the completeness of T depends 
only on the truth values of the propositions in T. On the other hand, finding for 
each model of T the instantiations i of x such that Ai \= 4>x*-i can be obtained 
only using a first-order constraint solver in T. This kind of problem is generally 
known as first-order constraint satisfaction problem. 



3.1.6 Vectorial quantifiers 

Let be a model and T a theory. Let X — X\ • • • X fi and y = 2/1 . . . y„ be two words 
on V of the same length. Let 0, pj and p3{x) be A^-formulas. We write 

3x p) for 3xi...3xnP^, 

\/x(p for \fxi...\fxn 'P, 

31xip{x) for VxVy(p(5) A V3(y) ^ /\»6{i,...,„} = J/i, 

3lx(p for (3x (p) A (3?x p>). 
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The word x, which can be the empty word e, is called vector of variables. Note that 
the formulas Bleip and 3\eip are respectively equivalent to true and to (p in any 
model A4. 

Notation 3.1.7 

Let Q be a quantifier taken from {V, 3,3!,3?}. Let x be vector of variables taken 
from V . We write: 

QxipA(f) for Qx{LpA4>). 

Example 3.1.8 

Let / = {1, n} be a finite set. Let (p and with z G / be formulas. Let x and yi 
with i S / be vectors of variables. We write: 

3a; (/3 A ^01 for 3x [ip /\ ^4>i) , 

yxifi A (fii for Va; {tp A (j>i), 

3!xv3A Aje/(3yi0O for 3!x A (3z/i(/)i) A ... A (3y„0„) A frite), 

3?x(pA Aje/^(3t/»0j) for 3?S A (^(3yi(/)i)) A ... A (^(3y„^„)) A irwe). 

Notation 3.1.9 

If 5; is a vector of variables then we denote by X the set of the variables of x. 

Let / be a (possible empty) finite set. The two following properties hold for any 
theory T: 

Property 3.1.10 
If T h 3?i if then 

{Bx'pA /\ -0,) <-> ((3x(^) A /\ -(35 A 
is/ ie/ 



Property 3.1.11 
If T ^ 3!x(^ then 



T 1= (35 ^ A /\ -0,) ^ /\ -(35 A 00- 



Full proofs of these two properties can be found in detail in (Djelloul 2006a I. 



3.2 The axioms of T 

Let be a set of function symbols containing infinitely many non-constant function 
symbols and at least one constant. Let finite be an 1-ary relation symbol. The theory 
T of finite or infinite trees built on the signature S = F U {finite} has as axioms 
the infinite set of propositions of one of the five following forms: 





^{f{x) = g{y)) 


[1] 


VxVy 


fix) = f{y) A^x, = yi 


[2] 


\/x3\z 


/\^z, = t,[xz] 


[3] 


VSVw 


-^{u — t[u,x] A finite (u) ) 


[4] 


\/x\/u 


{u = f{x) A finite {u)) ^ (u = f{x) A /\^finite{xi)) 


[5] 



14 



Khalil Djelloul, Thi-Bich-Hanh Dao and Thorn Friihwirth 



where / and g are distinct function symbols taken from i^, a; is a vector of (possibly 
non-distinct) variables Xi, y is a, vector of (possibly non-distinct) variables yt, z is 
a vector of distinct variables Zi , ti [xz\ is a term which begins with an element of F 
followed by variables taken from x or z, and x\ is a term containing at least one 
occurrence of an element of F and the variable u and possibly other variables taken 
from X. For example, we have T \= \/xiX2iu^{u = fi{xi, f2{u,X2)) A finite (u)) 
and T 1= Vu^(it — /i(/2('i*7 /o)i /o) A finite (u)) where /i and /2 are 2-ary function 
symbols and /o a constant of F. 

The forms [1],..., [5] are also called schemas of axioms of the theory T. Proposi- 
tion [1] called conflict of symbols shows that two distinct operations produce two 
distinct individuals. Proposition [2] called explosion shows that the same operation 
on two distinct individuals produces two distinct individuals. Proposition [3] called 
unique solution shows that a certain form a conjunction of equations has a unique 
set of solutions in T. In particular, the formula 3z z = f{z) has a unique solution 
which is the infinite tree /(/(/(...))). Proposition [4] means that a finite tree cannot 
be a strict subtree of itself. We emphasize strongly that should contain at 

least one occurrence of an element of F and the variable u. In Axiom [5], if x is the 
empty vector and / is a constant then we get Vu u — f A finite{u) ^ u = f , which 
means that the property finite{f) is true for each constant / of F. 

This theory is an extension of the basic theory of finite or infinite trees given 
by M. Maher in (Maher 19881 and built on a signature containing an infinite set 
of function symbols. Maher's theory is composed of the three first axioms of T 
and its completeness was shown using a decision procedure which transforms each 
proposition into a Boolean combination of existentially quantified conjunctions of 
atomic formulas. Note also that both Maher's theory and the theory T do not 
accept full elimination of quantifiers, i.e. there exist some quantified formulas whose 
quantifiers cannot be eliminated. For example, the formula 3xy ~ fix) is neither 
true nor false in T. It accepts in each model of T a set of solutions and another set 
of non-solutions. As a consequence, we cannot simplify it any further. This non-full 
elimination of quantifiers makes the completeness of T not evident. 



3.3 The models of T 



Let us extend the algebras given at the end of section 2.2 by the relation finite. 
More precisely, if ui, U2, U3 and 7/4 are respectively elements of D, Tr, Fi and Ra 
then the operations finite^{ui), finite'^ ^ {U2) , finite^^ {u^) and finite^"' (u^) are true 
respectively in D, Tr, Fi and Ra, if and only if ui, U2, U3 and W4 have a finite set 
of nodes. 

Let us now denote by: 



V =< D, (/^) f(:F, finite >, the model of sets of nodes, 
Tr —< Tr, (f^^) f^F,finite^^ >, the model of finite or in 
TZa —< Ra, (/^") j , finite^"' >, the model of rational ti 
Ti =< Fi, (/^*) f^F,finite^^ >, the model of finite trees. 



We have: 
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Theorem 3.3.1 

The models T), Tr and TZa are models of the theory T. 

This theorem is one of the essential contributions given in this paper and shows that 
our theory T is in fact an axiomatization of the structures D, Tr and Ra together 
with an infinite set of construction operations and the 1-ary relation finite . It also 
shows that T has at least one model and thus T 1= -^{true <-> false). 

Proof, first part: Let us show first that the model T) of sets of nodes is a model 
of T. In other words, we must show that the following properties hold: 



[3^] 
[4-] 
[5^] 



(Vai, 
(Vai, 
(Vai, 
(Vai, 
(n = 



a„ G i3)(V6i, &„ G D) ^(/^(fli, a„) = 5^(61, b„)) 
an G Z?)(V6i, 6„ G D) (/^(ai, a„) = /^(6i, 6„) ^ ALi 
a,„ G D)(3!6i,...,6„ G D) {/\"^^bi = tP[bi, ...,b„,ai, ...,am]) 
a,n G D)(Vit G D)^{u = f° [u, ai, a„] A finite^ (u)) 
a„ G D)(Vm G -D)(w = /^(ai,...,a„) A finite" {u)) ^ 
/°(ai,...,a„)AALiMte"K)) 



where / and 5 are distinct function symbols taken from F, t^[bi, ...,bn,ai, a^] 
is a term which begins with an element of F followed by variables taken from 
{oi, Cm, 5i, 6„}, and i^[u, oi, a„] is a term containing at least one occur- 
rence of an element of F and the variable u and possibly other variables taken from 
{ai, ...,a„}. According to Definition 



2.2.1 



and the definition of the relation finite 



the properties [1^], [2^], [4^] and [5^] hold. On the other hand, property [3^] is 
much less obvious and deserves to be proved. 

Let ai,...,am and 6i,...,6„ be elements of D. According to the first point of 
Remark [2?2!2l the P-formula 



(4) 



is equivalent in D to 

00 n 

/\ f\yk{bi) ^ yk{t?[bi,...,bn,ai,...,am\)- (5) 

fc = Oi=l 

Let i £ {l,...,n}. Let us denote by fi respectively ...,&„, oi, a,„]i the 
function symbol respectively the set of the variables which occur in the term 
<f [61, 6„, fli, a^]. According to the second and third point of Remark 2.2.2 
we have: 

• For each i G {1, n} there exists one node Lp^ = (e, /;), such that 

vo{tf [hi, ...,bn,ai, ...,arn]) = Wo}- 

• For each i G {1, ...,n} and each k > there exists a function fl-^i, which is 
independent from all the Vk+i{x), with x G oi, Omji, such that 

i^fe+i(if [61, ...,6„,ai, ...,am]) = (pl+iUvkih) , J^fc(6„), z/fc(ai), ...,Vk{am)]i), 

where [vk(bi), Vk{bn)i ^k{o-i)i ■■■1 i^k{o-m)\i is a tuple of elements of the form 
Vk{x) for all X G [61, ...,6„,ai, ...,0^]^. 
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Thus, the I?-formula ([5| is equivalent in V to 

n CO n 

(/\ Mbi) = {<Po})A(/\ /\ i^k+iibz) = vl+i{[i^k{bi), ■■■,'^k{bn),i^k{ai), ■■■,'^k{am)]i)), 

i=l fe=Oi=l 

from which we deduce that: 

• (i) For all i G {1, n}, i'o(fcj) has a constant value, which is equal to (e, fi). 

• (ii) Each Vk+iih) depends in the worst case on 
Vkibi), i^fc(6„), i^fc(ai), ;^fe(a„i), i.e. on Ukih), i^k{bn) and ai, a,„. 

Thus, by recurrenc^ on k, we deduce that (iii) each i/k+i{bi) with A: > and 
i G {1, depends only on oi, ...,am- From (i) and (iii) we deduce that all the 

bi's depend only on ai, a„i and thus property [3^] holds. In other words, for each 
instantiation of oi, a,„ by elements of D we can deduce the values of Vkibi) for 
all i G {1, n} and fc > 0. 

We have shown that the model V satisfies the five axioms of T and thus it is a 
model of T. 

Proof, second part: Let us now show that the model Tr of finite or infinite trees is 
a model of T. For that, it is enough to show the validity of the following properties 

[l^n (Vai,...,a„ G rr)(V6i,...,6„ G Tr) ^(/^''(ai, a„) = 5^'-(6i, b„)) 
[2^n (Vai,...,a„ G rr)(V6i, 6„ G Tr) (/^''("i, «n) = /^^(fei, ^ ALi = ^0 
[S^n (Vai , . . . , a™ G Tr ) (3 ! 6i , . . . , fe„ G Tr) (/\^^ J 6i = [6i , . . . , &„ , ai , . . . , a™] ) 
[4^n (Vai,...,a„ G rr)(V7i G Tr)^{u = ai, a„] A /imie'^'"(M)) 

[5^n (Vai,...,a„ G rr)(Vu G Tr)(M = f'^'' (a^ ■-, a„) A finite'^'' (u)) 
{u = /^'■(ai, a„) A finite'^'' (ai)) 

where / and g are distinct function symbols taken from F, tf'^[bi, 6„, ai, am] 
is a term which begins with an element of F followed by variables taken from 
{ai, Om, ^1, ■•■7 and ^-^''[u, ai, a„] is a term containing at least one occur- 
rence of an element of F and the variable u and possibly other variables taken 
from {ai,...,a„}. Since Tr is a subset of D, then according to the definition of 
f^'',/^, finite'^'' and finite", the properties [1^], [2^], [4^] and [5^] imply [1^'^], 
[2'^''], [4^'"] and [5'^'']. On the other hand, to show property [3"^''], it is enough to 
show the following implication: 

n m n 

(Vai, ■■■,am, hi, ...hn G =tf [bi,...,6„,ai,...,a„])A(/\a, G Tr)) ^ {/\heTr)) 

i—l i—l i—l 

(6) 

Let a, b, ai,...,am, bi,...,bn be elements of D. Let us consider the following notation: 
Arb{a, b) ^ each element of a is arborescent in b. 



If fc = then according to (ii) each 1^1(6^) depends in the worst case on vo(6i), ...,vo{b„) and 
ai,...,am- According to (i) all the i'o(bi), i'o(''n) have constant values and thus each viibi) 
depends only on ai , am- Let us now assume that each Vf^ (bi) depends only on ai, ...,am and 
let us show that this hypothesis is true for According to (ii), each ffc-|-i(6i) depends 

in the worst case on i'fc(fei), i^k{i>n) and ai, am, which according to our hypothesis depend 
only on ai, am- Thus, the recurrence is true for all fc > 0. 
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According to Definition 2.1.2| the Tr-formula 

n m 

(/\ bi = tf[bi, ...,6„,ai, ...,a,„]) A (/\ G Tr), 
1=1 j=i 

is equivalent in Tr to 

n rn 

(A ^« = "1' "™]) ^ (A ^'^K^, 

i=l 1=1 

which is equivalent to 

n oo m 

(A = ^^^[^1' "1' -'a™]) ^ ( A A ^'^K'^fc(iV), a»)), 

1=1 

which for each j > is equivalent in Tr to 



(7) 



fc=0 i=l 



(A ^' = &n,ai, ...,a„]) A ( A A Arb{iyk{N),a,)) A (A ^r6(z.j(fo,), 6^))- 

1=1 fe=Oi=l i=l 

(8) 

The equivalence (|7] ^^[^ holds for j = according to the fourth point of Remark 



2.2.2 and if we assume that this equivalence holds for an integer j with j > then 
according to the fifth point of Remark 2.2.2 we deduce that it holds also for j + 1. 
Thus, since the equivalence ([7|<-^|~8l) holds for any j > then according to the sixth 
point of Remark 2.2.2 and Definition |2.1.2 we deduce that (|8| implies 

n 

f\Arb{N,b,), 



which, according to Definition 2.1.2 impHes 

71 

A e Tr. 

i=l 

Thus, the implication ^ holds and Tr is a model of T. 

Proof, third part: Finally, let us show that the model TZa is a model of T. For that, 
it is enough to show the validity of the following properties: 



[1^° 

[2^1 (Vai, 

[3«1 (Vai, 

[4«'^] (Voi, 

[5^1 (Vai, 



(Vai,..., a™ e i?a)(V6i,...,6„ G i?a) ^(/«"(ai, a^) = g«''(6i, 6„)) 

a„ G i?a)(V6i,...,6„ G Ra) (/«'^(ai, a„) = /«"(6i, b„) ALi ' 
a™ G i?a)(3!6i,...,6„ G Ra) (A"^i fe» = if "[bi, 6n, ai, a„]) 
a™ G Ra)(yu G Ra)-.{u = t'^°-[u,ai, ...,a„] A finite'''"- (u)) 
a„ G 7?a)(Vu G -Ra)(it = f^^iai, ...,an) A finite^" {u)) ^ 



br) 



{u = /«"(ai, a„) A ALiMte-"'"(a.)) 

where / and 5 are distinct function symbols taken from F , if "[&i, 6„, ai, am] 
is a term which begins with an element of F followed by variables taken from 
{oi, a,m bi, bn}, and t^°'[u, oi, a„] is a term containing at least one occur- 
rence of an element of F and the variable u and possibly other variables taken from 
{ai,...,a„}. Since Ra is a subset of Tr and according to the definitions of f^^ , 
f^", finite'^'' and finite^'' then the properties [l^^^ P^l, [4^1 and [5^''] imply 
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[1^°], [2^°], [4^"] and [5^'']. On the other hand, in property [3^''], (in the preced- 
ing proof), a subtree of depth k of any bi is either one of the trees or a 
subtree of one of the a^'s with i G and j G {l,...,m}. This is true for 
k = and if we assume that it is true for k then we deduce that it is true for k+1. 
Thus, if the aj's are rational then the bi's in [3"^''] are also rational and thus we get 

We have shown that the models V, Tr and TZa are models of T. What about the 
model Ti of finite trees? Since F contains at least one function symbol / which is 
not a constant then according to Axiom [3] of T we have 

T \= 3\xx = f{x, ...,x). 

It is obvious that this property cannot be true in Ti, i.e. there exists no x & Fi 
such that X — f^^{x, x). Thus, the model J^i of finite trees is not a model of T. 
Let us end this section by a property concerning the cardinality of any model of 

T: 

Property 3.3.2 

Let M —< M, {J^'^) f^p.finite^^ > be a model of T. The model M has an infinity 
of individuals i such that M. ^ finite^\i). 

Proof 

Since the set F contains at least one function symbol / which is a constant then 
according to Axiom [5], with a; = e, we have 

M^fimte^Hf^')- (9) 

On the other hand, according to the definition of the signature of T, the set F 
contains an infinity of distinct function symbols which are not constants. Let /i 
one of these symbols. According to (|9| and Axiom [5] we have 

thus the individual /*^(/*^, /*^) is finite in M. Since the set F contains an 
infinity of distinct function symbols fi, f2, fs, ■■■ which are not constants then we 
can create by following the same preceding steps an infinity of finite individuals 

•••,/*0,/3*'(/'^ which start by distinct func- 
tion symbols. According to Axiom [1], all these individuals are distinct. According 
to (9) and Axiom [5] all these individuals are finite in Al. □ 

Corollary 3.3.3 

Each model of T has an infinite domain, i.e. an infinite set of individuals. 

4 Solving first-order constraints in T 

4-1 Discipline of the formulas in T 

Let us assume that the infinite set V is ordered by a strict linear dense order 
relation without endpoints denoted by Starting from this section, we impose 
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the following discipline to every formula tp in T: the quantified variables of ip are 
renamed so that: 

• (i) The quantified variables of ip have distinct names and different from those 
of the free variables. 

• (ii) For all variables x, y and all sub-formula^^ Lpi of ip, if y has a free 
occurrence in ipi and x has a bound occurrence in ipi then x )^ y. 

Example ^.1.1 

Let X, y, z, v be variables of V such that x )^ y >- z )^ v. Let ip be the formula 



3x X ^ fy A 



■^(3z z 
z 



x)K 
v) 



(10) 



The quantified variables of ip have no distinct names. Since the order >- is dense 
and without endpoints, there exists a variable w'vnV such that x>-y>-z'^v>-w, 
and thus ip is equivalent in T to 



3x x = fy A 



■^(3z z — x)A 
■^(3w w ^ v) 



In the preceding formula, the variables z and w have bound occurrences while the 
variables y and v have free occurrences. Since x>-y>-z>-v>-w then z and w must 
be renamed. On the other hand, since the order >- is dense and without endpoints, 
there exist two variables u and d'vaV such that x>~u>-d>~y>-z>-v>-w. Thus, 
the preceding formula is equivalent in T to 



3x X ^ fy A 



■^(3uu : 



: X)A 

v) 



In the sub- formula (3u u — x) the variable x has a free occurrence while the variable 
u has a bound occurrence. Since x >- u then u must be renamed. On the other hand, 
since the order >- is dense and without endpoints, there exists a variable ninV such 
that n>-x>-u>-d)^y)^z)^v>-'w. Thus, the preceding formula is equivalent in 
T to 

T(3nn = a;)A 
\3dd=v) 



3x X = fy A 



(11) 



This formula satisfies our conditions. Of course, the equivalence between (111 and 



(10 1 holds because in each step we renamed only the quantified variables. It is 



obvious that we can always transform any formula ip into an equivalent formula (f), 
which respects the discipline of the formulas in T, only by renaming the quantified 
variables of ip. It is enough for that to rename the quantified variables by distinct 
names and different from those of the free variables and then check each sub-formula 
and rename the quantified variables if the condition (ii) does not hold. 

We emphasize strongly that all the formulas which will be used starting from 
now satisfy the discipline of the formulas in T. 



By considering that each formula is also a sub-formula of itself. 
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4.2 Basic formula 

In this sub-section we introduce particular conjunctions of atomic formulas that we 
call basic formulas and show some of their properties. All of them will be used to 
show the correctness of our rewriting rules given in section |4.6| 

Definition 4-2.1 

Let Vi, u„, Ui, Um be variables. A basic formula is a formula of the form 

n rn 

{f\v, = U)h{f\fimte{ui)) (12) 

i=l 1=1 



in which all the equations = ti are flat. Note that if n = m = then (12) 
is reduced to true. The basic formula (12) is called solved if all the variables 
...,z)„,Mi, ...,u„i are distinct and for each equation of the form x = y we have 
X >- y. If a is a basic formula then we denote by 

• Lhs{a) the set of the variables which occur in the left hand sides of the 
equations of a. 

• FINI{a) the set of the variables which occur in a sub-formula of a of the 
form finite (x) . 

Note that if a is a solved basic formula then for all variables a; of a we have 
X G Lhs{a) ^ x^ FIN I {a). 



Example 4.2.2 

The basic formula x — x A finitely) is not solved because x'lf x. The basic formula 
2^ — f{y) Az — f{y) Afinite{x) is also not solved because a; is a left hand side of an 
equation and occurs also in finite{x). The basic formulas true (empty conjunction) 
and X — f{y) A z — f{y) A finite{y) are solved. 

According to the axiom [3] of T we deduce the following property: 

Property 4-2.3 

Let a be a solved basic formula containing only equations. Let x be the vector of 
the variables of Lhs{a). We have: T \= 3\xa. 



Property 4-2-4 

Let a and P be two solved basic formulas containing only equations. If Lhs{a) = 
Lhs{P) (mdT\=a^ (3 then T\=a^ p. 



Proof 

Let a and /3 be two solved basic formulas containing only equations such that 
Lhs{a) = Lhs{(3) and T \= a ^ (3. Let us show that we have also T |= /3 — > a. Let 
X be the vector of the variables of Lhs{a) and let y be the vector of the variables 
which occur in a — > /3 and do not occur in x. Since a and /3 are two solved basic 
formulas such that Lhs{a) = Lhs{(3) then (i) x is also the vector of the left hand 
sides of the equations of /3. Moreover, the following equivalences are true in T: 



Theory and Practice of Logic Programming 21 

^ VxVy a ^ l3 
^ \fy\fx V /3 

^ yy{^{^{3x P A a))) 
^ yy{^{3x P A ^a)) 



^ VyVx^/3Va 
^ VyVi P ^ a 
^ P ^ a 

□ 

Property 4.2.5 

Let a be a basic formula containing only equations and P and 6 two conjunctions of 
constraints of the form finite{x) such that a A/3 and a AS are solved basic formulas. 
We have T \= {a A P) ^ (a A (5) if and only if P and 6 have exactly the same 
contraints. 

Proof 

If P and 5 have the same constraints then it is evident that we have T |= (a A /3) <-> 
{a A 6). Let us now show that if we have T \= {a A P) ^ {a A S) then P and 6 
have the same constraints. Suppose that we have (*) T |= (a A /3) <-> (a A 5) and 
let us show that if finite(u) occurs in P then it occurs also in 6 and vice versa. 
If finite{u) occurs in P then T \= {a A P) ^ finite{u), thus from (*) we have (i) 
T \= {a A 5) ^ finite{u). Since a A /? is solved then u is not the left hand side of 
an equation of a. Thus, (ii) the conjunction a A 6 does not contain sub-formulas 
of the form u — t[x] A /\^ finite (xi). Since a A (5 is solved then S does not contain 
formulas of the form finite{v) where v is the left hand side of an equation of a. 
Thus, (iii) the conjunction a A d does not contain also sub-formulas of the form 
V = t[x,u\ A finitely). From (i), (ii) and (iii), finite(u) should occur in S. By the 
same reasoning (we replace P by 6 and vice versa), we show that if finite{u) occurs 
in S then it occurs in p. □ 

Let us now introduce the notion of reachable variable: 

Definition 4. 2. 6 

Let a be a basic formula and x a vector of variables. The reachable variables and 
equations of a from the variable xq are those which occur in a sub-formula of a of 
the form: 

Xo = to{xi) AXi^ ti{x2) A ... A Xn-l = tn^l{Xn), 

where a^i+i occurs in the term ti{xi^i). The reachable variables and equations of 
3a; a are those which are reachable in a from the free variables of 3a; a. A sub- 
formula of a of the form finite{u) is called reachable in 3a; a if u ^ a; or u is a 
reachable variable of 3a; a. 



according to the properties 



4.2.3 



and 



3.1.11 



according to: (i) and Property 4.2.3 and using the other 



sense (right to left) of the equivalence of Property 



3.1.11 



22 



Khalil Djelloul, Thi-Bich-Hanh Dao and Thorn Friihwirth 



Example ^.2.1 

In the formula: Buuw z = /(m, v) t\v — g{v, u) A w ^ f{u, v) A finite{u) A finite{x), 
the equations z = /(u, w) and f = g{v, u), the variables z, u and w and the formulas 
finite{u) and finite{x) are reachable. On the other hand the equation w = J{u,v) 
and the variable w are not reachable. 



Remark 4-2.8 

Let a be a solved basic formula. Let a; be a vector of variables. We have: 

• If all the variables of x are reachable in 3x a then all the equations and 
relations of a are reachable in 3x a. 

• If u = t[y] is a reachable equation in 3a; a, then a contains a sub-formula of 
the form 

k 

/\v,^t,[v,+,] (13) 
i=i 

with A: > 1 and (i) Vi ^ X, (ii) for all j € {1, ...,fc} the variable Vj+i occurs 
in the term (iii) Vk is the variable v, (iv) Wfc+i is the variable y and 

tk[vk+i\ is the term t[y\. 



According to the first point of Remark 4.2.8 and Definition 4.2.6 we have the 
following property: 

Property 4-2.9 

Let a be a solved basic formula. If the formula 3x a has no free variables and if all 
the variables of x are reachable in 3x a then x is the empty vector e and a is the 
formula true. 

According to the axioms [1] and [2] of T we have the following property: 
Property 4-2.10 

Let a be a basic formula. If all the variables of x are reachable in 3x a then 

T ^ 37xa. 

Property 4-2-11 

Let ir be a vector of variables and a a solved basic formula. We have: 

T h (35 a) ^ {3x' a'), 

where: 

• x' is the vector of the variable of x which are reachable in 3xa, 

• a' is the conjunction of the equations and the formulas of the form finite{x) 



which are reachable in 3. 



x a. 
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Proof 

Let us decompose x into three vectors x',x" and a;'" such that: 

x' is the vector of the variables of x which are reachable in 3x a. 

x" is the vector of the variables of x which are non-reachable in 3a; a and do not 

occur in the left hand sides of the equations of a. 

x'" is the vector of the variables of x which are non-reachable in 3x a and occur in 
a left hand side of an equation of a. 

Let us now decompose a into three formulas a', a" and a'" such that: 

a' is the conjunction of the equations and the formulas of the form finite(x) which 
are reachable in 3xa. 

a" is the conjunction of the formulas of the form finite{x) which are non- reachable 
in 3a; a. 

a'" is the conjunction of the equations which are non-reachable in 3a; a. 

According to Definition |4.2.6[ all the variables of x" and x'" do not occur in a' 
(otherwise they will be reachable) and since a is solved then x'" is the vector of the 
left hand sides of the equations of a'" and its variables do not occur in a". Thus 
the formula 3a; a is equivalent in T to 

(3x'a'A(3x" a" A (3a;'" a'"))). 



According to Property 4.2.3 we have T |= 3!a;"'Q;"'. According to Corollary 3.3.3 
we have T |= 3a;" a". Thus, the preceding formula is equivalent in T to (3a;' a'). 

□ 

Example 4-2.12 

The formula 3xyzwv — f{x,x) Aw = g{y,z,x) A finite (x) A finite {y) is equivalent 
in T to 

3.T V = f{x, x) A finite{x) A {3y z finite{y) A {3w w — g{y, z, x))), 
which, since T |= 3!ti; w = g(y, z, x) and T |= 3yz finitely), is equivalent in T to 

3a; w = f{x, x) A finite(x). 



Property 4.2.11 confirms the fact that the theory T does not accept full elimi- 
nation of quantifiers and shows that we can eliminate only non-reachable quan- 
tified variables. On the other hand, reachable variables cannot be removed since 
their values depend on the instantiations of the free variables. In fact, the formula 
3xv = f{x,x) A finite{x) is neither true nor false in T since for each model A4 of 
T there exist instantiations of the free variable v which make it false in A4 and 
others which make it true in M, and thus the reachable quantified variable x can- 
not be eliminated and the formula 3xv — f(x,x) A finite (x) cannot be simplified 
anymore. On the other hand, the formula 3ww — g(y, z, x) is true in any model of 
T and for any instantiation of z. The quantified non-reachable variable w can then 
be eliminated and the formula is replaced by true. As we will see in section |4.6| 
reachability, has a crucial role while solving first-order constraints in T. It shows 



24 



Khalil Djelloul, Thi-Bich-Hanh Dao and Thorn Friihwirth 



which quantifications can be chminated and enables to simplify complex quantified 
basic formulas. 

According to the axioms [1] and [2] and since the set F is infinite we have the 
following property: 

Property 4.2.13 

Let / — {1, n} be a finite (possibly empty) set and x and x' two disjoint vectors 
of variables. Let j/i,...,y„ be vectors of variables and Q;i,...,a„ solved basic formulas 
such that for alH € / all the variables of are reachable in 3yi a^. If each conjunc- 
tion ai contains at least (1) one sub-formula of the form finite{x) with x € X, or 
(2) one equation which contains at least one occurrence of a variable x ^ X U X', 
then: 

T h 3xx'{ /\ finite{x)) A (/\ -(3y, a,)). (14) 
xex' iei 



Proof 

Let M M, (/^) feF, finite^ > be a model of T. To show the validity of ^ it 
is enough to show that: 

M h ^xx'i /\ finite^' (x)) A (/\ -(3y, a,)). (15) 

Since the basic formulas ai are solved, they do not contain equations of the form 
X — X. Suppose now that one of the ai contains one equation of the form x = v 
with x G X {J X' and v € Yi. Since ai is solved then x >- v but according to 
the discipline of the formulas in T we have v >- a p^ Since the order >- is strict 
then X = V cannot be a sub- formula of ai. Thus, according to the conditions of 



Property 4.2.13 each conjunction contains at least (1) one sub-formula of the 



form finite{x) with x £ X, or (2) one equation of the one of the following forms: 

• (*) a; = /(wi, ...u„) with x e X U X' , 

• {**) X = V with X and v two distinct variables such that x £ X U X' and v ^ Y, 

• (***) V — t[x] where x is a variable of X U X' which occurs in the term t[x]. 



According to the first point of Remark 4.2.8 and since for all i G {!,..., n} the 



variables of i/i are reachable in 3yiai, then the equation v = t[x] is reachable in 
3yi Ui and thus according to the second point of Remark |4.2.8| the conjunction 
tti contains a sub-formula of the form {/\j^iVj ~ tj[vj^i]) with vi ^ Yi, for all 
j € {1, k} the variable Wj+i occurs in the term and w^+i is the variable 

X. But, since the case Vi G X U X' is already treated in (*) and (**), then we can 
restrict ourself without loosing generality to the case where Vi ^ YiU X U X', i.e. 



vi is free in ( 15 1 



In fact, the variable x has a free occurrence in 3yi ai and the variable v has a bound occurrence 
in 3yi ai (because ti is a quantified reachable variable in 3yi ai) and thus according to the 
discipline of our formulas we have v y x. 
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Let 

3xx'{ /\ finite^' (x)) A (/\ ^{3y, a*)) (16) 
xex' iei 

be an any instantiation of 3xx' (/\,^^^, finite^\x)) A (Aie/ ^(^^i c^i)) by individuals 
of Ai ■ Let us show that there exists an instantiation for the variables of X and X' 
which satisfies the preceding formula. For that, let us chose an instantiation which 
respects the following conditions: 

(i) For each x G X' , the instantiation x* of x satisfies M \= finite^\x*) . 
ill) If a conjunction a* contains a sub-formula of the form finite^^ (x) with a; € X 
then the instantiation x* of x satisfies A4 \= x* = f^^{x*, ...,x*) with / an n-ary 
function symbol of strictly positive arity which does not occur in any at with i € I. 

(iii) If a conjunction a* contains a sub-formula of the form x = f^'^{vi, ...Vn) with 
X G XUX', then the instantiation of x starts with a different function symbol than 
/• 

(iv) If a conjunction a* contains a sub-formula of the form x = v with x and v two 
distinct variables such that x E X U X' and v ^ Y, then the instantiation of x is 
different from those of v. 

(v) If a conjunction a* contains a sub- formula of the form (Aj=i — ^jbj-i-i]) 
with vi ^ {X U X' U Y), for all j E {1, k} the variable vj^i occurs in the term 

and Vk+i E X U X' , then the instantiation of v^+i is different from v* , 
where v* is the instantiation of v^+i obtained from those of v\ irj^ (16 1 so that 

A such instantiation of the variables of X and X' is always possible since : (1) 
there exists an infinity of function symbols in F which are not constants (2) the 
set of the individuals i of such that M \= finite^'' (i) is infinite (see Property 



3.3.2). As a consequence, according to axioms [1] and [4], this instantiation implies 



a conflict inside each sub-instantiated-formula 3yi a*, with i E {1, ...,n} and thus 

Mh3xx'{/\^{3y,a*)). 

iei 

Since this instantiation satisfies the first condition (i) of the preceding list of con- 



ditions then (16 1 holds and thus (151 holds. □ 



We emphasize strongly that this property holds only if the formula (14) satisfies 
the discipline of the formulas in T. This property is vital for solving first-order 
constraint over finite or infinite trees. In fact, since the variables of each with 
i E {1, n} are reachable in 3yi ai then we cannot eliminate or remove the quan- 
tification 3yi form 3yi a;, and thus solving a constraint containing such formulas is 



not evident. Property |4.2.13 enables us to surmount this problem by reducing to 



true particular formulas containing sub-formulas which does not accept full elimi- 
nation of quantifiers. 



Recall that vi {X U X' U Y) and thus vi is a free variable in \15\ . As a consequence, it is 
already instantiated in Jl6|. 
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Example 4-.2.H 

Let x,y, z and v be variables such that y > x > z > w. Let us consider the following 
formula (f: 



(17) 



-^{3ex — w)/\ 
-^{3ex^ g{x)) 

This formula satisfies the discipline of the formulas in T. Let M =< 
M, (/*^) f , finite'^^ > be a model of T. Note that we cannot eliminate the quanti- 
fier 3y in the sub-formula 3y z — f{y)/\y — g{x). In fact, this sub- formula is neither 
true nor false in T because there exist instantiations of the free variable z in A4 
which satisfy this sub-formula in A4 and others which do not satisfy it. On the other 



hand, Property 4.2.13 states that formula (17 1 is true in T for all instantiations of 
z even if the sub-formula 3y z — f{y) Ay = g{x) is neither true nor false in T. Let 
us check this strange result. For that, let us show that for each instantiation of the 
free variables z and w by two individuals z* and w* of M, there exists an instanti- 
ation X* of X which makes false the three Al-formulas {3y z* = f^{y) A y = g{x*)), 
{3ex* = w*) and (3ex* = g{x*)). We have: 

• In the formula {3y z — f(y) Ay ~ gix)), the variable x is reachable. Thus, its 
value is determined by the value of z (because z = f{g{x))). Two cases arise: 

— If z* is of the form f{g{i)) with i E M then it is enough to instantiate 
X by an individual x* G M which is different fronj^i, in order to make 
false {3yz* = /*^(y) A y = 9^' {x*)) in M. 

— if z* is not of the form f{g{i)) with i G M then the Al-formula {3y z* = 
f^{y) Ay — g^\x)) is false in M. for all the instantiations of x. 

• In the A^-formula {3ex = w*), it is enough to instantiate x by an element 
X* oi M which is different from w;* in order to make false the Al-formula 

{3ex* ^w*). 

• In the A^-formula {3ex = g^{x)), it is enough to instantiate x by an individ- 
ual which starts by a distinct function symbol than g in order to make false 
{3ex = g'^{x)) in M. 

Since the set of the functions symbols which are not constants is infinite then there 
exists an infinity of instantiations of x which satisfy the three preceding conditions. 
Each of these instantiations x* makes false the three Al-formulas {3y z* = f^^ (y) A 



y = g^^{x*)), {3£x* = w*) and {3£x* = 5*^(x*)) and thus (17 1 holds. 

4-3 Normalized formula 

For example, we can take x* = f^^{i). 
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Definition 4-3.1 

A normalized formula <p of depth d > 1 is a formula of the form 

-(3xaA/\^,), (18) 

with / a finite (possibly empty) set, a a basic formula and the ip[s are normalized 
formulas of depth di with d = I + max{0, di, d„}. 

Example 4.3.2 

Let / and g be two 1-ary function symbols which belong to F . The formula 



3e finite{u) A 



y = f{x) Ax = g{y) A ^(3e y = g{x) A finite{x)))A 
■^{3ex = f{z) /\ finite{z)) 



is a normalized formula of depth equals to three. The formula ^(3e true) is a nor- 
malized formula of depth 1 . The smallest value of a depth of a normalized formula 
is 1. Normalized formulas of depth are not defined and do not exist. 

We will use now the abbreviation wnfv for "without new free variables" . A formula 
if is equivalent to a wnfv formula in T means that T \= ip tp and ip does not 
contain other free variables than those of ip. 

Property 4-3.3 

Every formula ip is equivalent in T to a wnfv normalized formula of depth d > 1. 
Proof 

It is easy to transform any formula into a normalized formula, it is enough for 
example to follow the foUowings steps: 

1 . Introduce a supplement of equations and existentially quantified variables to trans- 
form the conjunctions of atomic formulas into conjunctions of flat formulas. 

2. Replace each sub-formula of the form false by ^true then express all the quantifiers 
and logical connectors using only the logical symbols ^, A and 3. This can be done 
using the following transformation of sub- formulas: 

{(p^(j>) =^ -^{pA^(p), 

{ip^4>) =^ (-(^A-(/)) A-(0A-(/3)), 

(yxifi) =^ -^{3x^ip). 

3. If the formula if obtained does not start with the logical symbol ^, then replace it 
by -^(3e true A ^f). 

4. Rename the quantified variables so that the obtained formula satisfies the imposed 



discipline in T (see Section 4. 1 ) 



5. Lift the quantifier before the conjunction, i.e. ip A (3xip) or (3x'ip) A ip, becomes 
3x ip A Ip because the free variables of ip are distinct from those of x. 

^'^ These equivalences are true in the empty theory and thus in any theory T. 



28 



Khalil Djelloul, Thi-Bich-Hanh Dao and Thorn Friihwirth 



6. Group the quantified variables into a vectorial quantifier, i.e. 3x(3yip) or 3x3y tp 
becomes 3xyip. 

7. Insert empty vectors and formulas of the form true to get the normalized form using 
the following transformations of sub-formulas: 

-(A ^'^') =^ ^ A (19) 

i£l iel 

-^{a/\ /\^ip,)=>^{3eaA /\^ip^), (20) 

i£l iel 

-^{3x A ^V'i) =^ -^C^x true A A ~^'Pj)- (21) 
ie-' jeJ 
with a a conjunction of elementary equations, / a finite (possibly empty) set and 
J a finite non-empty set. 

8. Rename the quantified variables so that the obtained normalized formula satisfies 
the discipline of the formulas in T. 

If the starting formula does not contain the logical symbol ^ then this transfor- 
mation will be linear, i.e. there exists a constant k such that n2 < kni, where ni is 
the size of the starting formula and n2 the size of the normalized formula. We show 
easily by contradiction that the final formula obtained after application of these 
steps is normalized. □ 



Example 4-3-4 

Let X, V, w, u be variables such that x )^ v )^ w >- u. Let / be a 2-ary function 
symbol which belongs to F. Let us apply the preceding steps to transform the 
following formula into a normalized formula: 

(/(u, v) = f{w, u) A {3x u — x))\J (3u\/w u = f{v, w)). 

Note that the formula does not start with ^ and the variables u and w are free in 
f{u,v) = /(w,u) A (3a; u = x) and bound in BuVww = f{v,w). Note also that this 
formula does not respect the discipline of the formulas in T. 

Step 1: Let us first transform the equations into fiat equations. The preceding 
formula is equivalent in T to 

(3ui ui — f{u,v) /\ui — f{w,u) A {3xu = x)) V {3u\/wu ~ f{v,w)), (22) 

where ui is a variable of V such that ui>~x'^v'^w>~u. 

Step 2: Let us now express the quantifier V using ^, A and 3. Thus, the formula 
( [22| is equivalent in T to 

{3ui ui = f{u, v) Aui — /{w, u) A {3x u — x)) \/ (3u^{3w ^{u — f{v, w)))). 

Let us also express the logical symbol V using ^, A and 3. Thus, the preceding 
formula is equivalent in T to 

^(^(3ui ui — f{u, v) Aui — f{w, u) A (3a; u — x))A -^{3u -^{3w -^{u = f{v,w))))). 

(23) 
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Step 3: As the formula starts with ^, we move to Step 4. 

Step 4: The occurrences of the quantified variables u and w in (3u^{3w ^{u = 
f{v,w)))) must be renamed. Thus, the formula (23 1 is equivalent in T to 

^(^(3ui Ml = /(u, v)Aui = /(w, u)A{3x u = x))h^{3u2 ^{3wi ^{u2 = f{v, wi))))), 

where U2 and wi are variables of V such that wi >- U2 )^ ui >- x >- v )^ w >- u. 
Step 5: By lifting the existential quantifier 3x, the preceding formula is equivalent 
in T to 

^(^(3ui 3x ui — f{u, v) Aui — f{w, u) Au — x) A -^(3u2 ^{3wi ^{u2 — f{v, wi))))). 

Step 6: Let us group the two quantified variables x and ui into a vectorial quantifier. 
Thus, the preceding formula is equivalent in T to 

-^{-^{3uix ui = f{u, v) A ui — f{w,u) A u = x) A -^(3u2 ^(3wi ^(u2 = f{v, u>i ))))). 

Step 7: Let us introduce empty vectors of variables and formulas of the form true 
to get the normalized formula. According to the rule (19), the preceding formula is 
equivalent in T to 



3s true A 



-^{3uixui — f{u, v) Aui — f{w, u) Au = x)A 
^{3U2 ^{3wi -n{u2 f{v, Wl)))) 



which using the rule ( 20 1 with / = is equivalent in T to 
3s true A 



-^(3uixui — f {u, v) A ui ^ f (w , u) A u = x) A 
-^{3u2 -^{3wi -^{3su2 = f{v,wi)))) 



which using the rule (21 1 is equivalent in T to 
-1 3s true A 



(3uix ui — f{u, v) Aui = f{w, u) Au — x)A 
(3u2 true A -^(3wi true A -^(3su2 = f{v,wi)))) 



Step 8: This is a normalized formula of depth 4 which respects the discipline of the 
formulas in T since wi >- U2 >- ui >- x >- v >- w >- u. 



4-4 General solved formula 

Definition 4.4.1 

A general solved formula is a normalized formula of the form 

n 

-(3x«A /\-(3y,A)), 

i=l 

with n > and such that: 

1. a and all the Pi, with i G {1, are solved basic formulas. 

2. If a' is the conjunction of the equations of a then all the conjunctions a' A Pi, 
with « e {1, ...,n}, are solved basic formulas. 

3. All the variables of x are reachable in 3xa. 

4. For all i G {1, n}, all the variables of yi are reachable in 3yi Pi. 
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5. If finite{u) is a sub-formula of a then for all i G {l,...,n}, the formula (3i 
contains either finite{u), or finite{v) where i; is a reachable variable from u 
in a A (3i and does not occur in a left hand side of an equation of a A Pi. 

6. For all i G {!,..., n}, the formula (3i contains at least one atomic formula 
which does not occur in a. 



Example 4-4-^ 

Let w, V, Ml, U2, U3 be variables such that w v >~ ui U2 >~ u:^. The following 
formula is not a general solved formula 

-n{3e finite (w) A -n{3vw = v A finite{v))). (24) 

This formula satisfies all the conditions of Definition |4.4.1| but it does not satisfy 
the discipline of the formulas in T. In fact, the variable v is bound in {3vw — v A 
finitely)) and the variable w is free in (3v w = v A finitely)) and thus we should have 
y )~ w and not w > v. Let M4 be a variable such that uj^ > w )~ y > ui y U2 >- U3. 



The formula (24) is equivalent in T to 

-^{3e finite{w) A ^(3-^4 w = U4 A finite{v))). 

This formula respects the discipline of the formulas of T but is not a general solved 
formula since it does not satisfy the first condition of Definition |4.4.1[ In fact, 
w = U4 A finite{v) is not a solved basic formula since we have U4 >- w. 
The following formula is a general solved formula 

-^{3y 1*1 = f{y) Ay — U2 A finite{u2) A -^{3w U2 = f{w) A finite{w) A finite{u^))) . 
Property 4-4-3 

Let </? be a general solved formula. If ip has no free variables then ip is the formula 
~'i3£ true) else neither T \= -^Lp nor T \= ip. 

Proof 

Let be a general solved formula of the form 

^{3xaA f\^{3y,l3i)), (25) 

two cases arise: 

(1) If </? does not contain free variables, then according to the first and third 
condition of Definition |4.4.1| and using Property |4.2.9| we get x = e and a = true. 



As a consequence, the formula (25) is equivalent in T to 



i3etrueA /\^{3y,l3,)), (26) 



Since (26 1 has no free variables then each 3yi (3i has no free variables. According to 
the first and fourth condition of Definition |4.4.1| and using Property |4.2.9| we get: 
for all i ^ I: yi — e and Pi = true. But according to the last condition of Definition 



4.4.1 all the formulas Pi should be diS'erent from true (since we do not distinguish 
between a and a A true). Thus, the set I must be empty. As a consequence, ip is 
the formula -^{3e true). 
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(2) If (f contains free variables then it is enough to show that there exist two 
distinct instantiations ip' and (p" of ip by individuals of T?]^ such that 

Tr 1= If' and Tr \= ^<p". 

Note first that if / 7^ then each {3yi (3i), with i E I, should contain at least one 
free variable. In fact, if {3yi (3i), with i E I, does not contain free variables then this 
formula is of the form (3e true) according to the first and fourth point of Definition 
|4.4.1| and Property |4.2.9| which contradicts the last condition of Definition |4.4.1| 
(since we do not distinguish between a and a A true). Thus each (3yi(3i), with 
i E I, contains at least one free variable that can be instantiated. On the other 
hand: 

Case 1: If 3a; a contains free variables then we can easily find an instantiation of 
the free variables of 3x a which contradicts the constraints of a. In fact, let z be a 
free variable. Four cases arise: 



li z = w IS & sub-formula of a then according to Definition 4.4.1 a is a solved 
basic formula and thus z >- w. Ks a, consequence, w cannot be a quantified variable 
otherwise the formula (p does not respect the discipline of the formulas in T. Thus 
is enough to instantiate z and w by two distinct values. 

If z = f{w) is a sub-formula of a then it is enough to instantiate z by a tree which 
starts by a function symbol which is different from /. 



If w = z or w = t[z\ is a sub-formula of a then according to Definition 4.4.1 all the 



variables of x are reachable in 3x a and thus according to the first point of Remark 



4.2.8 the equations w = z and w = t[z\ are reachable. According to the second point 
of Remark 14.2.81 the value of z is linked to another free variable v which occurs in 
a left hand side of an equation of a. This case is already treated in two preceding 
cases. 

If finite{z) is a sub- formula of a then it is enough to instantiate z by an infinite 
tree. 

As a consequence, the instantiated formula of 3xa will be false in Tr and thus 
Tr 1= Lp' . On the other hand, by following the same preceding steps and since: 

(i) the set F contains an infinity of function symbols which are not constants, 

(ii) Tr contains an infinity of individuals u of Tr such that Tr \= finite ^{u), 

(iii) (y9 is a general solved formula, 

then we show that there exists at least one instantiation which satisfies all the 
constraints of a and contradicts the constraints of each Pi, with i E I. In fact, 
(iv) in order to contradicts each constraint Pi, it is enough to follow the preceding 
discussion (by replacing a hy Pi ) and use (i) and (ii). On the other hand, according 
to Definition |4.4. 1| all the variables of x are reachable in 3a; a, thus according to the 
first point of remark [4. 2. 8 1 all the equations and relations of a are reachable in 3a; a. 
According to the second point of remark [4.2.8| the values of the free variables which 
occur in these formulas are mainly linked to those of free variables which occur in 
left hand side of equations of a. According to the two first conditions of Definition 

Recall that Tr is the model of finite or infinite trees. 
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4.4.1 the variables of Lhs{a) are distinct and do not occur in FIN 1(a), Lhs{fii) 
and FINI{f3i) for all i £ {1, ...,n}. As a consequence, from (iv) and using (i), (ii) 
and (iii) there exists at least one instantiation which satisfies 3x a and contradicts 
each 3yi Pi in Tr, with i E I and thus Tr \= ^(p" . Note that if / = then we have 
also Tr ^ -^(p" and Tr \= ip' using the preceding instantiations. 

Case 2: If 3a; a does not contain free variables then according to the first and 



third condition of Definition 4.4.1 and Property 4.2.9 we have x — e and a — true. 
Since cp contains at least one free variable then I Let k E I. Since: 

(i) the set F contains an infinity of function symbols which are not constants, 

(ii) Tr contains an infinity of individuals u of Tr such that Tr \= finite ^(u), 

(iii) p is a, general solved formula, 

then we can easily find an instantiation of the free variables of 3yk j3k which satisfies 
the constraints of Pk (similar to the second part of Case 1 by replacing a by f3k)- 
Such an instantiation makes false the instantiated formula ^(3yk Pk) in Tr and 
thus Tr \= p' . On the other hand, according to (i), (ii) and (iii), we show that 
there exists at least one instantiation which contradicts the constraints of each 
with i E I (similar to the second part of Case 1 with a = true and x — s). As a 
consequence, this instantiation satisfies all the -^(3yi /?j) in Tr, with i E I and thus 
Tr h 

From Case 1 and Case 2, we have Tr \= p' and Tr \= ^p" , and thus neither 
T \= p nor T \= -^p. 

□ 

Example 4-4-4 

Let vi, V2, V, u and w be variables such that vi>-V2>-v>-u>-w. Let p be the 
following general solved formula 

^{3v u — g{v, w) A -^{3vi v — g{v,vi) Avi — /(«)) A ^(3^2 w — g{w, V2) Av2 ^ .f{w)) 

(27) 

Let us consider for example the model Tr of finite or infinite trees. If we instantiate 
the free variable u by the finite tree 1 where 1 is a constant in F which is distinct 
from g then according to axiom [1] of conflict of symbols, the instantiated formula 
of ( p7| is true in Tr. On the other hand, if u is instantiated by a tree of the form 
g{v*,'w*) with V* =/= g{v* , f{v*)) (for example v* = 1) and w* =/= g^w* , f{w*)) 



(for example w* = 1) then the instantiated formula of (27) is false in Tr. As a 



consequence (27 1 is neither true nor false in the theory T. The reader should not 
think that the fact that we have neither T |= -^p nor T \= p means that p is 
unsatisfiable in T. This is of course false. In fact, since neither T |= -^p nor T \^ p 
then p has in each model of T a set of solutions which make it true in A4 and 
another set of non-solutions which make it false in M . We also remind the reader 
that all the properties given after Section |4.1| hold only for formulas that respect 
the discipline of the formulas of T. 



A similar property has been shown for the finite trees of J. Lassez (Lassez and 



Marriott 1987) and the rational trees of M. Maher (Maher and Stuckey 1995). M. 



Maher in (Maher and Stuckey 19951 has also shown that if the set F is finite and 
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contains at least one n-ary function symbol with n > 2, then the problem of deciding 
if a formula containing equations and the logical symbols A, V, ^ is equivalent to 
a disjunction of conjunctions of equations is a co-NP-complete problem, and the 
problem of deciding if an expression represents a nonempty set of rational trees 
is NP-complete. Note also that in all our proofs we have not used the famous 



independence of inequations (Colmerauer 1984 Lassez et al. 1986 Comon 1988 



Lassez and McAloon 1986 1 but only the condition that the signature of T is infinite 



and contains an infinity of function symbols which are not constants and at least 
one symbol which is a constant, which implies in this case the independence of the 
inequations. 

Property 4-4-5 

Every general solved formula of the form -^(3x a A Ar=i ^i^Vi A)) equivalent in 
T to the following Boolean combination of existentially quantified basic formulas: 



{^{3xa))v\/{3xy^aAP,). 



i=l 



Proof 
Let 



.(3xaA /\-(3y,A)), 



(28) 



be a general solved formula. According to the third point of Definition |4.4.1| all the 
variables of x are reachable in 3a; a. Thus, according to Property |4.2.10[ we have 
T \= 31xa. According to Property 3.1.10 the formula (28) is equivalent in T to 

n 

-((3xa) A f\ -^{3xaM3y,l3i))), 



i=l 



i.e. to 



(-(3xa))V \/(3SaA(3y,A)), 



which, since the quantified variables have distinct names and different from those 
of the free variables, is equivalent in T to 



h{3xa)) V \/{3xyaA(3^), 



4=1 



which is a Boolean combination of existentially quantified basic formulas. □ 



Definition 4-4-6 

Let <p be a formula of the form 

n 

3xa/\ /\^{3y^(3,), (29) 
with X and y two vectors of variables, n> and a and the Pi, with i G {1, ...,n}. 
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basic formulas. We say that ip is written in an explicit solved form if and only if the 
formula -^tp, i.e. 

n 

-(3iaA/\-(3y,A)), (30) 

is a general solved formula. 

This definition shows how to easily extract from a general solved formula, a simple 
formula (p which has only one level of negation and where the solutions of the free 
variables are given in clear and explicit way, i.e. for each model A4 of T, it is easy 
to find all the possible instantiations of the free variables of ip which make it true 
in A^. In fact, according to Definition |4.4.1| we warrant among other things that 
the left hand sides of the equations of a are distinct and do not occur in those of 
each Pi , the left hand sides of the equations of each /3i are distinct and we cannot 
eliminate any quantification since all the variables are reachable. 

Example 4-4-'^ 

Let w, V, ui, U2, U3 be variables such that w v ui >- U2 u^. Let ip be the 
following general solved formula 

^(3v ui — f{v) Av — U2 /\ finite{u2) A ^(3w U2 — f{w) A finite{w) A finite{uj,))) . 

According to Definition |4.4.6| the following formula 4> is written in an explicit solved 
form: 

3v ui — f{v) Av — U2 A finite{u2) A -^{3w U2 = f{w) A finite{w) A finite{uz)) ■ (31) 

Let us chose the model Tr of finite or infinite trees and let us give all the possible in- 
stantiations 1*5 of the free variables ui, U2, "3 so that the instantiated formula 



of is true in the model Tr. From (31 1 it is clear that we have two possibilities: 

• Solution 1 : 

— Ug is any infinite tree. 

— U2 is any finite tree. 

— ul is the tree f{u2)- 

• Solution 2 : 

— U3 is any finite tree. 

— U2 is any finite tree which starts by a function symbol which is different 
from /. 

— u*i is the tree /(uj). 



4 ■ 5 Working formula 

Definition 4-5.1 

A working formula is a normalized formula in which all the occurrences of ^ are 
replaced by with k E {0, ...,5} and such that each occurrence of a sub-formula 
of the form 

p^^''{3xaAq), with fc > 0, (32) 
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satisfies the k first conditions of the condition list bellow. In (32 1 a is a basic 

formula, 5 is a conjunction of working formulas of the form ^'''{3yi (3i A g,), 
with n > 0, f3i a, basic formula, qi a conjunction of working formulas, and in the 
below condition list a' is the basic formula of the immediate top-working formulep^ 
p' of p if it exists. 

1. If p' exists then T |= a — > a' and T |= aeq a'eq where aeq and a'^^ are the 
conjunctions of the equations of a respectively a' . Moreover, the set of the 
variables of Lhs{a') U FINI{a') is included in those of Lhs{a) U FINI{a). 

2. The left hand sides of the equations of a are distinct and for all equations of 
the form u — v we have u)^ v. 

3. a is a basic solved formula. 

4. If p' exists then the set of the equations of a' is included in those of a. 

5. The variables of x, the equations of a and the constraints of the form finiteix) 
of a are reachable in 3xa. Moreover, if n > then for all i G {1, ...,n} the 
conjunction Pi contains at least one atomic formula which does not occur in 
a. 

The intuitions behind these working formulas come from an aim to have a full 
control on the execution of our rewriting rules by adding semantic informations on 
a syntactic form of formulas. We emphasize strongly that -^^ does not mean that 
the normalized formula satisfies only the fc*'* condition but all the conditions i with 
l<i<k. 

Example 4-5.2 

Let Wi, W2, W3, vi, u he variables such that wi >- W2 >- >~ vi >~ u. This is a 
working formula of depth 2: 

-''^{3'Wi u — f{wi) A Wi = Wi A finite{u))A 
-n^{3w2 u = f{vi) Aw2^ f{vi) Afinite{vi))A 
-^^{3w3 u = f{vi) A ui = /(ws) Afinite{w3)) 



3vi u = f{vi) A finite{u) A 



Definition 4-5.3 

An initial working formula is a working formula which begins with and such 
that A; = for all the other occurrences of A final working formula is a working 
formula of depth less or equal to 2 with k — 5 for all the occurrences of 

The relation between the final working formulas and the general solved formulas 
is expressed in the following property: 

Property 4-5-4 

Let p he the following final working formula -^^{3x aA/\"^^ -^^{3yi /3i)). The formula 
-i(3S a A Ar=i ^i'^Vi is a general solved formula equivalent to p in T where (if 
is the basic formula (3i from which we have removed all the equations which occur 
also in a. 



^® In other words, p' is of the f orm -1* (3x' a' A p* A p) where p* is a conjunction of working 
formulas and p is the formula (|32| . 
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Example 4- 5- 5 

Let W2, V, u and ui be variables such that W2 y v y u y u^. Let (p be the following 
final working formula 

3ev = u A finite{u)A 
-•^(Bs v = uAu = uiA finite(ui))A 
-i^(3w2 V = u A u = s{w2) A finite{w2)) 

The formula 

3ev = uA finite{u)A 
^(3e u ~ ui A finite{ui))A 
^{3w2 u = s{w2) A finite{w2)) 
is a general solved formula equivalent to tp in T. 



4-6 Rewriting rules 



We now present the rewriting rules which transform an initial working formula of 

any depth d into an equivalent conjunction of final working formulas. To apply the 
rule pi =^ P2 to the working formula p means to replace in p a sub-formula pi by 
the formula P2, by considering that the connector A is associative and commutative. 
In the following, the letters u, v and w represent variables, the letters x, y and z 
represent vectors of variables, the letters a, b and c represent basic formulas, the 
letter q represents a conjunction of working formulas, the letter r represents a 
conjunction of flat equations, formulas of the form finite{x) and working formulas. 
All these letters can be subscripted or have primes. 



(1) -■^(BS -u = u A r) 

(2) -n^{3xv = uAr) 

(3) -^^{3xu = V Au = t Ar) 

(4) -i^{3xu = fvi...Vn Au = gwi...Wm A r) 

(5) -i^{3xu = fvi...Vn Au = fwi-.-Wn A r) 

(6) -.^(3xoAg) 

(7) -'^{3xfinite{u) A finite{u) A r) 

(8) {3x u = V A finite (u) A r) 

(9) {3x fimte{u) A a A q) 

(10) -i^i3xu = fivi, Vn) Afinite{u) A r) 

(11) -.^(3x0 A g) 

(12) -.*(3xaAgA-.°(3yr)) 

(13) -.*(3aoAo' AgA-.^(3ya" Ar)) 

(14) ^^(3SaAgA^^(3j/a)) 

(15) -*(3a;aAAr=i-'(3yi60) 

3xa AqA 

(16) ^ 3ybA 



-.^(3xr) 
~'^{3xu = V Ar) 
-^^ (3x u = vAv = tAr) 
true 

-i^(3xw = fvi...v„ A /\"=i Vi = Wi A r) 
-i^(3aa A q) 

-<^{3xfinite{u) A r) 
-i^{3xu = V A finite{v) A r) 

true 

-?{3xu = f{v\,...,v„) A f\"^^finite{vi) A r) 
-?'(^xaAq) 
-i*(3xo A q A -i^(3y o A r)) 
-^'^{3xa A a' AqA -i''(3y o A r)) 
true 

^\3x' a' A ^.^^^\3y[h'^r) 
^*(3SaAgA-.^(3y6))A 



a: 



^{3xyziCi A go)* 
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with u > V, f and g two distinct function symbols taken from F. In rule (3), t 
is a flat term, i.e. either a variable or a term of the form /(xi, with / an 

n-aiy function symbol taken from F. In rule (6), the equations of a have distinct 
left hand sides and for each equation of the form u = v we have u "i^ v. In rule (9), 
the variable u is reachable from u in a. In rule (10), the variable u is non-reachable 
from u in a. Moreover, if / is a constant then n = 0. In rule (11), a is a solved basic 
formula. In rule (13), a and a" are conjunctions of equations having the same left 
hand sides and a' is a conjunction of formulas of the form finite{u). In rule (15), 
n > and for all i G {1, ...,n} the formula bi is different from the formula a. The 
pairs {x'.a') and (y^,&9 are obtained by a decomposition of x and a into x'x"x"' 
and a' A a" A a'" as follows: 

• o' is the conjunction of the equations and the formulas of the form finite{x) 
which are reachable in 3x0. 

• x' is the vector the variables of x which are reachable in 3x a. 

• a" is the conjunction of the formulas of the form finite{x) which are non- 
reachable in 3x a. 

• x" is the vector the variables of x which are non-reachable in 3x a and do not 
occur in the left hand sides of the equations of a. 

• a'" is the conjunction of the equations which are non-reachable in 3x a. 

• x'" is the vector the variables of x which are non-reachable in 3a; a and occur 
in the left hand sides of the equations of a. 

• b* is the formula obtained by removing from bi the formulas of the form 
finite{u) which occur also in a" 

• is the vector of the variables of j/ix'" which are reachable in 3yix"' b* . 

• 6^ is the conjunction of the equations and the formulas of the form finite{x) 
which are reachable in 3yix"' b*. 

• K C {1, ...,n} is the set of the indices i such that i G K \{ and only if no 
variable of x" occurs in b[. 

• The formula /\i^K ^^i^Vi^d* formula f\i(zK^^{^y'i^'i) which we 
have renamed the quantified variables so that they satisfy the discipline of 
the formulas in T. 

In rule (16), n > and go is the formula q in which all the occurrences of -^^ 
have been replaced by -1°. The formula Ar=i "'^i^^V^i ^ Qo)* is the formula 
-^^{3xyzi Ci A go) in which we have renamed the quantified variables so that 
they satisfy the discipline of the formulas of T. 

The use of indices on the negations of the working formulas enables us to force 
the application of the rules to follow a cleax strategy until reaching a conjunction of 
final working formulas. In fact, the algorithm follows two main steps while solving 
any first-order constraint in T: 

• (i) A top-down propagation of basic formulas following the tree structure 

of the working formulas and using the rules (1),...,(13). In this step, basic 
formulas are solved and copied in all sub-working formulas. Finiteness is also 
check and inconsistent basic formulas are removed by the rules (4) and (9) . 
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• (ii) A bottom-up elimination of quantifiers and depth reducing of the working 
formulas using the rules (14),.. .,(16). Inconsistent working formulas are also 
removed in this step. 

More precisely, starting from an initial working formula (fi of the form -i^(3Sa A 

AiG/ 9*)' "^ticrc all the arc working formulas whose negations are of the form 
rule (12) propagates the atomic formulas of a into a sub-formula qi, with i £ I, 
and changes the first negation of Qi into -i^. The rules (1),...,(5) can now be applied 
until the equations of a have distinct left hand sides and for each equation of the 
form u = V we have u )~ v. Rule (6) is then appli(xl and changes the first negation 
of qi into —i^. The algorithm starts now a new phase which consists in solving the 
basic formulas using the rules (7),. ..,(10). In particular finiteness is checked by rule 
(9). When a solved basic formula is obtained, rule (11) is applied and changes the 
negation into -i^. Note that if a working formula starts by then its top working 
formula starts by Rule (13) is then applied. It restores some equations and 
changes the first negation into Rule (12) can now be applied again since all the 
nested negations are of the form —f^ and so on. This is the first step of our algorithm. 
Once the sub- working formulas of depth 1 axe of the form -i'*(3yj6j), the second 
step starts using rule (15) with n = on all these sub-working-formulas of depth 1 
and transforms their negations into -i^. Inconsistent working formulas of the form 
-•^(35; a A -'^{3ya) A q) are then removed by rule (14). When all the inconsistent 
working formulas have been removed, rule (15) with n 7^ can be applied on 
the sub-working-formulas of depth 2 of the form -i^(3Sa A -1^(3^^ 6^)) and 
produces working formulas of the form -^^{3xa A ^^{3yibi)). Rule (16) can 
now be applied on the working formulas of depth d > 2 of the form -i'^(3x a A q A 
-■^(3^ hA/\^^^ ""^(3^1 Ci))). After each application of this rule, new working formulas 
containing negations of the form —f^ are created which implies the execution of the 
rules of the first step of our algorithm, starting by rule (12) and so on. After several 
applications of our rules, we get a conjunction of working formulas whose depth is 
less or equal to 2. The rules are then applied again until all the negations of these 
working formulas are of the form -1^. It is a conjunction of final working formulas. 

Example 4^.6.1 

Let / and g be two function symbols taken from F of respective arities 2, 1. Let 
Wi, W2, Wi, Ml, M2, 1*3 be variables such that Wi > W2 )^ Vi )^ Ui > U2 > u^- Let us 
run our rules on the following initial working formula 

3vivi = f{u\,U2) Au2 = g{ui)A 

-"(3.«it;i =.9(«;i))A . (33) 

^°(3w;2 U2 — g{w2) Aw2 — g{u3) A finite{w2)) 

According to rule (12), the preceding formula is equivalent in T to 

3wi vi = /(ui, U2) A M2 = g{ui)A 
-^^{3wi vi = g{wi) Avi = f{ui,U2) Au2= g{ui))A 
~f^{3w2U2 = g{w2) Aw2 = gius) A finite{w2)) 

The application of rule (4) on the sub formula -1^(3^1 vi = g{wi) Avi = /(mi, W2) A 
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U2 = g{ui) Afinite{w2)) simplifies this sub formula into the formula true. Thus, the 
preceding formula is equivalent in T to 

4 r 3vi vi = /(wi, U2) A it2 = giui)A 
[ -'°{3w2 U2 = g{w2) Aw2= g{u3) A finite{w2)) J ' 

which according to rule (12) is equivalent in T to 
r 3vivi = f{ui,U2) Au2 = g{ui)A 

[ -^°{3w2Vi = f{ui,U2) Au2 = g{u-i) Au2 = g{w2) Aw2 = g{u3) Afinite{w2)) 
Rule (5) can now be applied. Thus, the preceding formula is equivalent in T to 

_^4 r 3vi vi = /(ui, M2) Au2 = giui)A 

[ -^^{3w2 vi = f{ui,U2) Au2 = g{w2) Aw2 = ui AW2 = ^(ws) A finite {w 2)) J ' 

which according to rule (3) is equivalent in T to 

4 3vivi = f{ui,U2) Au2 = g{ui)A 
[ -•'^{3w2Vi = f{ui,U2) Au2 = g{w2) Aw2 = ui Aui= g{u3) Afinite{w2)) 

Since the conjunction of equations of the sub-formula which starts by has distinct 
left hand sides and W2 >~ ui, then rule (6) can be applied. Thus, the preceding 
formula is equivalent in T to 

r 3viVi = f{ui,U2) AU2 = 5f(ui)A 

[ -1^(3^2 t^i = f{ui,U2) Au2 = g{w2) Aw2 = ui Aui = g{u3) Afinite{w2)) J ' 

which according to rule (8) is equivalent in T to 

_^4 r 3vi vi = /(wi, 1*2) Au2 = g{ui)A 

[ -''^{3w2 vi = f{ui,U2) Au2 = g{w2) Aw2 = ui Aui = g{uz) Afinite{ui)) J ' 

which according to rule (10) is equivalent in T to 

_^4 r 3vi vi = f{ui,U2) AU2 = g{ui)A 

[ -^^{3w2Vi = f{ui,U2) Au2 ^ g{w2) Aw2 ^ ui Aui ^ g{u3) Afinite{u:i)) 

Since the basic formulas are solved then rule (11) can be appUed. Thus, the pre- 
ceding formula is equivalent in T to 

_^4l 3vivi = f{ui,U2) Au2 = g{ui)A 

[ -1^(3^2 1'l = f{ui,U2) Au2 = g{w2) Aw2 = ui Aui = g{u3) Afinite{us)) J ' 

which according to rule (13) is equivalent in T to 

_A \ 3vi Vi = f{ui,U2) AU2 = g{ui)A 

I -''^{3iU2Vi = f{ui,U2) Au2 = g{ui) Aw2 = uiAu\ = g{u3) Afinite{u3)) 

Rule (15) can now be applied with n = 0. Thus, the preceding formula is equivalent 
in T to 

_^4 r 3i>i vi = /(ui, W2) A M2 = g{ui)A 

[ -^^{3evi = f{ui,U2) AU2 = g{ui) Aui = g{u3) A finite{u3)) 

Once again rule (15) can be applied, with n ^ and we get the following final 
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working formula 

^5 r 3eu2^ 5(wi)A 

|_ -n"''{3eu2 = g{ui) A ui = 5(1*3) Afinite{u3,)) \ ' 

which according to Property |4. 5. 4| is equivalent in T to the following general solved 
formula 

^["1*2= .9("i)A 

|_ -^{ui = g{u3) A finite{u3)) 

We have seen in the preceding example how the rules (1),...,(15) can be applied. 
Let us now see how rule (16) is applied. 



Example 4-. 6. 2 

Let s and be two function symbols taken from F of respective arities 1, 0. Let wi, 
W2, u, V be variables such that wi >- W2 >- v u. Let us apply our rules on the 
following working formula of depth 3: 



3e true A 



^^{3eu = s(w))A 
-^^{3wi u ~ s{wi) A wi 



s(w))A 

^{3sv = M A ^^(3ew = u A u = 0) A 



'(3W2 V — u /\ u — s{w2))) 



By considering that 



• {3xa) 

• q = 



(3etrue) 
'{3eu^ s{v))A 
'{3wi u — s{wi) A Wi 
{3e V — u) 

-^^{3£V = 



u Au^ 0)A 
- u Au : 



-^^(3w2 V — u A u = s(w2)) 

rule (16) can be applied and produces the following formula 

-n^{3e true A -n^{3eu = s{v)) A -^^{3wi u = s{wi) Awi = s{v)) A ^^{3ev = u))A 
-n^{3ev = w A w = A ^°(u = s{v)) A ^"(Bwn u = s(wii) A Wn = s{v)))A 
-^'^{3w2 V — u A u = s{w2) A ^°(3e u — s{v)) A -'^{3wi2 u — s{wi2) A W12 = s{v))) 

where wu and W12 are variables such that wu >- W12 >- wi >- W2 y v >- u. Now, 
only the rules (1),...,(15) will be applied until all the negations are of the form 
Rule (16) will not be applied anymore since there exists no working formulas of 
depth greater or equal to 3 and the rules (1 ),..., (15) never increase the depth of the 
working formulas. 



Property 4.. 6. 3 

Every repeated application of the preceding rewriting rules on an initial working 
formula p is terminating and producing a wnfv conjunction of final working formulas 
equivalent to p in T. 
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Proof 



Proof first part: The application of the rewriting rules terminates. Let us introduce 
the function a : q n, where g is a conjunction of working formulas, n an integer 
and such that 



• a{true) — 0, 

• a{-^{3x a ^ ip)) = 2"('^), 



with a a basic formula, ip a conjunction of working formulas and the working 
formulas. Note that if a(p2) < ct{pi) then Q;(p[p2]) < oi{p) where p[p2] is the formula 
obtained from p when we replace the occurrence of the formula pi in phy p2- This 



function has been introduced in (Vorobyov 19961 and (Colmerauer and Dao 2003 1 



to show the non-elementary complexity of all algorithms solving propositions in the 
theory of finite or infinite trees. It has also the property to decrease if the depth of 
the working formula decreases after application of distributions as it is done in our 
rule (16). 

Let us introduce also the function A : (m, a) — > n, where w is a variable, a a basic 
formula, n an integer and such that 



A(w, a) 



0, 



A(w,a), 



if the conjunction of the equations of a has 
not distinct left hand sides or contains a 
sub-formula of the form x — y with y x, else 

if u does not occur in a left hand side of an equation 
of a, or u is reachable from u in a, else 

if the equation u = v is in a, else 



Sr=i '^i'^ij^)^ if the equation u = /(fi, is in a. 



Since the variables which occur in our formulas are ordered by the order relation 
" ;^ " , we can number them by positive integers such that 

X >- y ^ no{x) > no{y), 

where no{x) is the number associated to the variable x. Let us consider the 10- 
tuple (ni, 712, 713, 714, ?i5. Tig, 717, 718, fig, fT'io) where the Ui^s are the following positive 
integers: 

m = a{p), 

712 is the number of 

71-3 is the number of 

714 is the number of occurrences of function symbols in sub-formulas of the form 
(...). For example, if we have -^^(3x x = f{y) Ay = f{x) Ax = g{x, w) Ay = f{y)) 

then 714 = 4. 

715 is the sum of all the no{x) for each occurrence of a variable x in a basic formula 
of a sub- formula of the form -^^ {...). For example, if we have x = f{x, z) Ay = 
X A finite{z) A ...) then 715 — no{x) + no{x) + no{z) + no{y) + no{x) + no{z) H- .... 
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uq is the number of formulas of the form v = u with u )^ v in sub-formulas of the 

form -^^{...), 

TLj is the number of 

rig is the sum of all the A(w, a) for each occurrence of a sub- formula finite{u) in 
a basic-formula a of a working formula of the form —''^{3xa A q). For example, if 
we have —''^{3zx = f{x,z) A z = fiUyU) /\ finite (x) A finite (x) /\ finite{z)) then 
rig — X{x, a) + X{x, a) + X{z, a) = 1 + 1 + (2 + 1 + 1) where a is the basic formula 
X ~ f{x, z) f\ z = f{y, y) A finite{x) A finite{x) A finite{z). 
rig is the number of —'^ 
nio is the number of 

For each rule, there exists a positive integer i such that the application of this rule 
decreases or does not change the values of the n^'s, with 1 < j < i, and decreases 
the value of rr^. These i are equal to: 1 for the rules (4), (9), (14) and (16), 2 for 
rule (12), 3 for rule (6), 4 for rule (5), 5 for the rules (1), (3), (7) and (8) , 6 for 
rule (2), 7 for rule (11), 8 for rule (10), 9 for rule (13), and 10 for rule (15). To each 
sequence of formulas obtained by a finite application of the preceding rewriting 
rules, we can associate a series of 10-tuples (ni, n2, ns, n4, ns, ng, n7, ng, ng, riio) 
which is strictly decreasing in the lexicographic order. Since the rr^'s are positive 
integers, they cannot be negative, thus, this series of 10-tuples is a finite series and 
the application of the rewriting rules terminates. 

Proof, second part: Let us now show that for each rule of the form p =^ p' we have 
T \= p p' and the formula p' remains a conjunction of working formula. 



Correctness of the rules (1),...,(14) 



The rules (1),...(5) are correct according to the axioms [1] and [2] of T. Rules (6) 
and (11) are evident. The rules (7) and (8) are true in the empty theory and thus 
true in T. In rule (9), the variable u is reachable from itself in a, i.e. the basic 
formula a contains a sub-formula of the form 



ti AU2 = t2 A ... A Un = tr, 



(34) 



where Ui occurs in the term ti_i for all i € {2, n} and u occurs in i„. According to 
Definition 4.5.1 since our working formula starts with then all the equations of 
a have distinct lef hand sides and for all equations of the form x — y we have x >- y. 



Thus, there exists at least one equation in (34 1 which contains a function symbol 
which is not a constant, otherwise (34 1 is of the form u — U2AU2 = 1*3 A ... A u„ = u 
which implies u >- U2 >~ ... >- u, i.e. u >~ u which is false since the order >- is 
strict. Thus, according to the fourth axiom of T we have T \= a ^ -^finite{u). 
As a consequence, rule (9) is correct in T. Rule (10) is correct according to the 



last axiom of T. Rule (13) is correct according to Property 4.2.4 and Definition 



4.5.1 The rules (12) and (14) are true in the empty theory and thus true in T. 
Note that according to Property |4.2.5| two solved basic formulas having the same 
equations are equivalent if and only if they have the same relations finite{x). This 



is why in Definition 4.5.1 of the working formulas (more precisely in condition 4) 
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we force only the equations to be included in the sub-forworking formulas and use 
the elementary rule (14) to remove inconsistent working formulas of depth 2. 



Correctness of rule (15) 

1=1 ieK 

with n > 0, and for all i G {!,..., n} the formula bi is different from the formula 
a. The pairs (x',a') and (jj[,h^) are obtained by a decomposition of x and a into 
x'x"x"' and a' A a" A a'" as follows: 

a' is the conjunction of the equations and the formulas of the form finite(x) which 
are reachable in 3x a. 

x! is the vector the variables of x which are reachable in 3a; a. 

a" is the conjunction of the formulas of the form finite(x) which are non- reachable 

in 3a; a. 

x" is the vector the variables of x which are non-reachable in 3a; a and do not occur 
in the left hand sides of the equations of a. 

a'" is the conjunction of the equations which are non-reachable in 3a; a. 

x'" is the vector the variables of x which are non-reachable in 3a; a and occur in the 

left hand sides of the equations of a. 

b* is the formula obtained by removing from bi the formulas of the form finite{u) 
which occur also in a" 

y'i is the vector of the variables of yix'" which are reachable in 3yix"' b*. 

b[ is the conjunction of the equations and the formulas of the form finite{x) which 

are reachable in 3yix"'b*. 

K C {1, ...,rt} is the set of the indices i such that i E K li and only if no variable 
of x" occurs in b[ . 

The formula AieK '^^i'^v'i ^'i)* the formula Aiex '^^{'^v'i K) ™ which we have re- 
named the quantified variables so that they satisfy the discipline of the formulas in 
T. 

Let x' ,x" ,x'" ,y' and a' ,a" ,a'" ,b* ,bi be the vector of variables and the basic 



formulas defined above. According to Definition 4.2.6 (i) all the variables of x" 
and a;'" do not occur in a', otherwise they are reachable in 3a; a. On the other hand, 
since the first negation in the left hand side of rule (15) is of the form then 



according to Definition 4.5.1| (ii) a is a solved basic formula and thus a;'" is the 



vector of the left hand sides of the equations of a'" and its variables do not occur 
in a". Thus, according to (i) and (ii) the left hand side of rule (15) is equivalent in 
T to 

n 

-(3a;' a' A (3a;" a" A (3a;'" a'" A /\ ^{3y,b,)))). 

i=i 

Since a is a solved basic formula then a"' is a solved basic formula which contains 



only equations and thus according to Property 4.2.3 we have T \= 3\x"' a!" . Thus 
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according to Property 3.1.11 the preceding formula is equivalent in T to 

n 

^{3x'a' A (3a;" a" A /\ ^{3x"' a'" A (3y, b,)))), 



i=l 



which, according to the discipline of the formulas in T (the quantified variables 
have distinct names and different from those of the free variables ) , is equivalent in 
T to 



n{3x' a' A {3x" a" A /\ ^{3x'"y, a'" A 6,)))- 



(35) 



i=l 



Since all the nested negations in the left hand side of rule (15) are of the form 
then according to Definition 4.5.1 for all i G {1, n}, the set of the equations of a 
is included in those of bi. As a consequence, the formula (35) is equivalent in T to 

n 

^{3x' a' A {3x" a" A f\ -(3x"'y, h))), 



i.e. to 



n(3x' a' A {3x" a" A /\ ^{3x"'y, b*))). 



Since all the nested negations in the left hand side of rule (15) are of the form 
then according to Definition 4.5.1 for all i e {1, n}, b* is a solved basic formula. 
Thus, according to Property 4.2.11[ the preceding formula is equivalent in T to 

n 

^{3x'a' ^{3x"a" ^ f\^{3m))). 



which is equivalent in T to 

-(35' a' A ( /\ ^{3y', h[)) A (3a;" a" A 



e{i,...M-K 



where K C {1, is the set of the indices i such that i G if and only if no 

variable of x" occurs in b[. Since all the nested negations in the left hand side of rule 
(15) are of the form then according to Definition 4.5.1 for all i G {1, ...,n} — K, 



the variables of are reachable in 3y[ h[ and the formula b'^ is a solved basic 
formula. Moreover, since each h[ does not contain sub-formulas of the form finite{x) 
which occur also in a" (see the construction of b*), then the formula 3x" a" A 
AiG{i n}-K ^(3y' K) satisfies the conditions of Property ' 



4.2.13 



As a consequence. 



according to Property |4.2.13| the preceding formula is equivalent in T to 

.(3x'a'A /\ ^i3y[b',)), 



i.e. to 



-(3a;' a' A /\ -(3y^6^)*), 

where /\ -^^ -^(3y^ &■)* is the formula /\-^^ ^^{3y^b'^ in which we have renamed 
the quantified variables so that they satisfy the discipline of the formulas in T. 
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According to the conditions of application of rule (15) and the form of the negations 
in the left hand side of this rule, we check easily that we can fix the negations of 
the preceding formula as follows 

Thus, rule (15) is correct in T. 



Correctness of rule (16) 



3xa A qA 
3ybA 

KU-H3-z,c.) 



-^^{3xa Aq A^^{3yb))A 
A"=i ^^{^xyzic, A qo)* 



with n > 0, and go is the formula q in which all the occurrences of have been re- 
placed by -i". The formula A"=i ^"'(^xyzi q Ago)* is the formula A"=i ~^^(^^yzi c, A 
go) in which we have renamed the quantified variables so that they satisfy the dis- 
cipline of the formulas of T. 

The left hand side of rule (16) is equivalent in T to 

n 

^{3xaA q A -^{3yh A ^ \/ {3z^ a))). 

i=l 

Since the first negation of -^{3yb... in the left hand side of rule (16) is of the form 
then according to Definition 



4.5.1 



all the variables of y are reachable in 3y b, 
and thus according to Property 4.2.10 we have T \= 31 yb. According to Property 



3.1.10| the precedent formula is equivalent in T to 

n 

^{3xaAqA^{{3yb)A^{3ybA\J{3z,c,)))). 

i=l 

By distributing the A on the V and the 3 on the V and since the quantified variables 
have distinct names and different from those of the free variables then the preceding 
formula is equivalent in T to 

n 

-^{3xaA q A ^{{3yb) A ^ \/ {3z,y b A c^))) , 

i=l 

n 

^{3xaAqA{{^{3yb))v\/{3z,ybAc,))), 



i.e. to 



i=l 



i.e. to 



-(3x (a A g A -(3y 6)) V Y (a A g A {3z,y b A q))), 
1=1 

which, according to the discipline of the formulas in T (the quantified variables 
have distinct names and different from those of the free variables), is equivalent in 
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T to 



i.e. to 



i.e. to 



-^{3x {a A q A ^{3y b)) V \J {3z,y a A g A 5 A c;)), 

i=l 
n 

T((3xa A q A ^(3y 6)) V \J {3xziy a A q Ab A q)), 



i=l 



-•(Bxa A q A ^{3yb)) A -^{3xyzi a Aq Ab A q). 

i=l 

Since we have ^^{3y b... in the left hand side of rule (16) then according to Definition 
we have (i) T |= 6 ^ a. But since we have also ^^(3zj for alH e {1, n}, 



4.5.1 



then according to Definition 4.5.1 we have (ii) T \= Ci —i- b. From (i) and (ii) we 



have T \= Ci —^ {a Ab). Thus the preceding formula is equivalent in T to 

n 

^{3xaAqA^{3yb)) A /\^{3xyz,Ci A q), 

i=l 

i.e. to 

n 

^{3xaAqA^{3yb)) A /\^{3xyz, A q)* , 



i=l 

where Ar=i '^'^i^^V^i A q)* is the formula Ar=i ^^(^s^j/^i c-i A g) in which we have 
renamed the quantified variables so that they satisfy the discipline of the formulas 
of T. According to the conditions of application of rule (16) and the form of the 
negations in the left hand side of this rule, we check easily that we can fix the 
negations of the preceding formula as follows 

n 

^*{3xaAqA -^^3y b)) A /\ ^\3xyz, c, A go)* , 

1=1 

where go is the formula q in which all the occurrences of ^'^ have been replaced by 
Thus rule (16) is correct in T. 

Proof, third part: Every repeated application until termination of the rewriting rules 
on an initial working formula produces a conjunction of final working formulas. 
Recall that we write Aig/ fi^ ^^'^ ^^^^ conjunction each formula of the form ipi-^ A 
ipi^ A ... A Lpi^ A true. In particular, for / = 0, the conjunction Aie/ is reduced to 
true. Moreover, we do not distinguish two formulas which can be made equal using 
the following transformations of sub-formulas: 

(p A ip =^ ip, ip Alp =4> Tp A ip, {tp A ijj) A (j) (p A (t/j A 4>), 
if A true p, pM false =^ p. 

Let us show first that every substitution of a sub- working formula of a conjunction 
of working formulas by a conjunction of working formulas produces a conjunction 
of working formulas. Let Aie/ be a conjunction of working formulas. Let ipt^ with 

G J be an element of this conjunction of depth d^. Two cases arise: 
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1. We replace ipk by a conjunction of working formulas. Thus, let f\j^j^ (f'j be a con- 
junction of working formulas which is equivalent to ipk in T. The conjunction of 
working formulas Aig/ fi is equivalent in T to 

( A ^ ( A -^j) 

which is clearly a conjunction of working formulas. 

2. We replace a strict sub- working formula of by a conjunction of working formulas. 
Thus, let be a sub- working formula of ipk of depth < dk (thus </i is different 
from ipk)- Thus, ipk has a sub- working formuleP] of the form 

^{3xaAi/\^i) A(b), 

where L is a finite (possibly empty) set and all the ipi are working formulas. Let 
Aje J 'f'j ^ conjunction of working formulas which is equivalent to (j) in T. Thus 
the preceding sub-working formula of tpk is equivalent in T to 

-(3iaA(A^0A(A</'.)), 
leL jeJ 

which is clearly a sub-working formula and thus ipj^. is equivalent to a working 
formula and thus Aig/ equivalent to a conjunction of working formulas. 

From 1 and 2 we deduce that (i) every substitution of a sub-working formula of a 
conjunction of working formulas by a conjunction of working formulas produces a 
conjunction of working formulas. 

Since each rule transforms a working formula into a conjunction of working for- 
mulas, then according to the sub-section "proof: first part" and (i) we deduce that 
every repeated application of the rewriting rules on an initial working formula ter- 
minates and produces a conjunction of working formulas. Thus, since an initial 
working formula starts by and all its other negations are of the form -^^ then 
all long the application of our rules and by going down along the nested negations 
of any working formula ip obtained after any finite application of our rules, we can 
build many series of negations which represent the paths that we should follow from 
the top negation of ip to reach one of the sub-working formulas of tp of depth equal 
to one. Each of these series is of the one of the following forms: 

• a series of followed by a possibly series of 

• a series of followed by one followed by a possibly series of 

• a series of followed by one followed by a possibly series of 

• a series of followed by one followed by a possibly series of 

• a series of followed by one or two 

• one or two 

^"^ By considering that the set of the sub-formulas of any formula <p contains also the whole formula 
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While all the negations of these series are not of the form or their length is greater 
than 2 then one of the rules (1),...,(16) can still be applied. As a consequence, when 
no rule can be applied, we obtain a conjunctions of formulas of depth less or equal 
to 2 in which all the negations are of the form It is a conjunction of final working 



formulas. Since all the rules do not introduce new free variables then Property 4.6.3 
holds. □ 



4-7 The Solving Algorithm 

Let p be a formula. Solving p in T proceeds as follows: 

(1) Transform the formula -^p (the negation of p) into a wnfv normalized formula 
Pi equivalent to ^p in T. 

(2) Transform pi into the following initial working formula p2 

P2 = ^''(3e true A ^"(3e true Api)), 

where all the occurrences of ^ in pi are replaced by 

(3) Apply the preceding rewriting rules on p2 as many time as possible. According 
to Property |4.6.3| we obtain at the end a wnfv conjunction p-^ of final working 
formulas of the form 

n m 

i=i j=i 

According to Property |4.5.4[ the formula p^ is equivalent in T to the following wnfv 
conjunction p4 of general solved formulas 

n rii 

i=i j=i 

where [3*^ is the formula [3ij from which we have removed all the equations which 
occur also in a^. Since P4 is equivalent to -^p in T, then p is equivalent in T to 

n rii 

/\-(3y,,/3*,-)), 
i=i j=i 

which is equivalent to the following disjunction p^ 

n TLi 

\/(3x,«,A /\-(3y„-/3*,.))- 

i=i j=i 

This is the final answer of our solver to the initial constraint p. Note that the 
negations which were at the beginning of each general solved formula of p^ have 
been removed and the top conjunction of p4 has been replaced by a disjunction. As 
a consequence, the set of the solutions of the free variables of ps is nothing other 
than the union of the solutions of each formula of the form 3xi Ofj A/\"4i Pij)- 
According to Definition |4.4.6| each of these formulas is written in an explicit solved 
form which enables us to easily extract the solutions of its free variables. On the 
other hand, two cases arise: 
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If p4 does not contain free variables then according to Property |4.4.3 the for 



niula p4 is of the form A"=i ^(^^ true) and thus is of the form V"=i true. 
Two cases arise: if rt = then is the empty disjunction (i.e. the formula 
false). Else, if n ^ then since we do not distinguish between ip A ip and ip, 
P5 is the formula 3e true. 



• If P4 contains at least one free variable then according to Property 4.4.3 
neither T p4 nor T \= -^p4 and thus neither T |= -^p^ nor T \= p^. 

Since T has at least one model and since P5 is equivalent to p in T and does not 
contain news free variables then we have the following theorem: 

Theorem 4. 7.1 

Every formula is equivalent in T either to true, or to false, or to a wnfv formula 
which has at least one free variable, which is equivalent neither to true nor to false, 
and where the solutions of the free variables are expressed in a clear and explicit 
way. 

The fact that T accepts at least one model is vital in this theorem. In fact, if T does 
not have models then the formula true can be equivalent to false in T. In other 
words, a formula can be equivalent to true in T using a finite application of our 
rules and equivalent to false using another different finite application of our rules. 
Theorem |3 . 3 . 1 1 prevents these kinds of conflicts and shows that T has at least three 
models V, Tr and TZa and thus T |= -^{true <-> false). 

Corollary 4^.7.2 

T is a complete theory. 

Note that using Theorem |4.7.1| and the properties |4.4.5| and |4.2.11| we get Maher's 



decision procedure ( Maher 1988) for the basic theory of finite or infinite trees. 



5 Implementation of our algorithm 



We have implemented our algorithm in CH — h and CHR (Constraint Handling 



Rules) (Fruehwirth 1998 Fruehwirth and Abdennadher 2003| Schrijvers and Frue 



hwirth 20061. The C++ implementation is a straightforward extension of those 
given in (Djelloul and Dao 2006b). It uses records and pointers and releases un- 



used pointers after each rule application. The CHR implementation was done us- 
ing Christian Holzbaur's CHR library of Sicstus Prolog 3.11.0. It consists of 18 
CHR constraints and 73 CHR rules - most of them are needed for the compli- 
cated rules (15) and (16) of our algorithm. Even if our C++ implementation has 
given better performances, we think that it is interesting to show how can we 
translate our rules into CHR rules. We will be able to quickly prototype optimiza- 
tions and variations of our algorithm and to parallelize it. For CHR, the imple- 
mentation of this complex solver helps to understand what programming patterns 
and language features can be useful. The CHR code without comments and ex- 
amples, but pretty-printed, is about 250 lines, which is one seventh of the size 
of our C++ implementation. Indeed for code size and degree of abstraction it 
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seems only possible and interesting to describe the CHR implementation, and 
we do so in the following. The reader can find our full CHR implementation at 



http://khalil.djelloul.free.fr/solver.txt and can experiment with it on- 



line using webchr at http : / / chr . inf ormatik . uni-ulm . de/'^webchr/[ 



5.1 Constraint Handling Rules (CHR) Implementation 

CHR manipulates conjunctions of constraints that reside in a constraint store. Let 
H, C and B denote conjunctions of constraints. A simplification rule H <^ C \ B 
replaces instances of the CHR constraints H by B provided the guard test C 
holds. A propagation rule H ^ C \ B instead just adds B to H without removing 
anything. The hybrid simpagation rules will come handy in the implementation; 
Hi\H2 'i^ C \ B removes matched constraints H2 but keeps constraints Hi. 

The constraints of the store comprise the state of an execution. Starting from 
an arbitrary initial store (called query), CHR rules are applied exhaustively until 
a fixpoint is reached. Trivial non-termination of a propagation rule application is 
avoided by applying it at most once to the same constraints. 

Almost all CHR implementations execute queries from left to right and apply 
rules top-down in the textual order of the program (Duck et al. 2004 1. A CHR 
constraint in a query can be understood as a procedure that goes efficiently through 
the rules of the program. When it matches a head constraint of a rule, it will look 
for the other constraints of the head in the constraint store and check the guard. 
On success, it will apply the rule. The rule application cannot be undone. If the 
initial constraint has not been removed after trying all rules, it will be put into 
the constraint store. Constraints from the store will be reconsidered if newly added 
constraints constrain its variables. 



5.1.1 CHR Constraints 

The implementation consists of 18 constraints: two main constraints that encode the 
tree data structure of the working formulas (nf/4) and the atomic formulas (of/2), 
9 auxiliary constraints that perform reachability analysis, variable renaming and 
copying of formulas, and 7 constraints that encode execution control information, 
mainly for rules (15) and (16). 

In more detail, nf (Parentid, Id,K,ExVars) describes a negated quantified basic 
formula with the identifier of its parent node, its own identifier Id, the level K from 
-i*' and the list of existentially quantified variables. Var=FlatTerm of Id denotes 
an equation between a variable and a fiat term (a variable or a function symbol 
applied to variables) that belongs to the negated sub-formula with the identifier Id. 
finite(U) of Id denotes the relation /i7izfe(C/). 

It is easy to represent any working formula ip using conjunctions of nf/4 and 
of/2 constraints. It is enough to create one nf/4 constraint for each quantified basic 
formula of (p and to use a conjunction of of/2 constraints to enumerate the atomic 
formulas linked to each quantified basic formula. 
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Example 5.1.2 

Let tp be the following working formula 

3uu= lA 

-^"{3eu = s{v))A 
-i°(3wi u = s{wi) Awi = s{v))A 

-,^(3£V = s{u) Au= 1 A finite{wi))A 
-■^(3w3 V = s{u) Au= 1 Aw2 = s{ws) A finite{ws)) 



-^^{36 V = s{u) A u = 1 A 



(p can be expressed using the following conjunction of constraints: 

iif(Q,Pl,4,[U]),U= lof PI, 

nf (PI, P2, 0, []), U = S(V) of P2, 

nf(Pl,P3,0, [W1]),U= S(Wl)of P3,W1 = S(V) of P3, 

nf (PI, P4, 5, []), V = S(U) of P4, U = 1 of P4 

iif(P4,P5,5, []), V = S(U) of P5,U= lof P5,finite(Wl)of P5 

nf (P4, P6, 5, [W3]), V = S(U) of P6, U = 1 of P6, W2 = S(W3) of P6, f inite(W3) of P6 



5.1.3 CHR Rules 

The rules (1) to (14) have a rather direct translation into CHR rules. It seems hard 
to come up with a more concise implementation. 

1 Locally simplify equations 

(1) @ nf(Q,P,l,Xs) \ U=U of P <=> true. 

(2) @ nf(Q,P,l,Xs) \ V=U of P <=> gt(U,V) I U=V of P. 

(3) @ nf (Q,P,l.Xs) , U=V of P \ U=G of P <=> gtCU.V) I V=G of P. 

(4) @ nf (Q,P,l,Xs) , U=F of P, U=G of P <=> notsamefunctor (F,G) I true(P). 

(5) a nf (Q,P,l,Xs), U=F of P \ U=G of P <=> samef unctor (F , G) I 

same_args(F,G,P) . 

(6) a nf(Q,P,l,Xs) <=> nf (Q,P,2,Xs) . 



2 finiteness check 

(7) @ nf (P0,P,2,Xs) , finite (U) of P \ finite (U) of P <=> true. 

(8) a nf (P0,P,2,Xs) , U=V of P \ finite(U) of P <=> var(V) I finite(V) of P. 
(9+10)anf (P0,P,2,Xs),U=T of P \ finite(U) of P <=> nonvar(T) I 

reach. args (U , T , P) , f inite.args (U , T , P) . 

(11) a nf(q,P,2,Xs) <=> nf (Q,P,3,Xs) . 



% 4/0-4/1 copy down before solving 

(12) a nf (Q,P,4,Xs) , A of P, nf (P,Pl,0,Ys) ==> A of PI. 

nf(Q,P,4,Xs) \ nf (P,Pl,0,Ys) <=> nf (P,P1 , 1 , Ys) . 



°/. 4/3-4/4 replace down after solving 

(13) a nf (q,P,4,Xs) ,U=V of P, nf (P,Pl,3,Ys)\ U=G of PI <=> V\==G I U=V of PI. 
nf(Q,P,4,Xs) \ nf(P,Pl,3,Ys) <=> nf (P,P1 ,4,Ys) . 



y, 4/5-true trivial satisfaction - each A of PI also occurs as A of P 
(14) a nf (q,P,4,Xs) , nf (P,Pl,5,Ys) <=> 

\+(f indconstraint(Pl, (A of Pl),_), \+f indconstraint(P, (A of P),_)) I 

true (P) . 

Note that rules (1) to (5) are similar to the classical CHR equation solver for flat 
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rational trees (Fruehwirth and Abdennadher 2003 Meister and Fruehwirth 20061. 
By applying results of (Meister and Fruehwirth 2006), we can show that the worst- 
case time complexity of these rules of the algorithm is quadratic in the size of the 
equations. 

In the rules (2) and (3), the predicate gt(U,V) checks if U ;^ V. Note that the 
constraint true(P) used in rule (4) removes all constraints associated with P using 
an auxiliary rule not shown. 

In rule (9+10) reach_args(U,T,P) checks reachability of U from itself in P. If 
so, true(P) will be executed and thus P will be removed, implementing rule (9). 
Otherwise, the subsequent f inite_args(U,T,P) will propagate down the finite 
relation from U to its arguments, implementing rule (10). 

In the rules (12) and (13) we handle equations one by one (due to the chosen 
granularity of the constraints), and thus we need auxiliary second CHR rules that 
perform the update of the level K afterwards. 



For rule (14) the implementation is easy when nested negation-as-absence (Van 



Weert et al. 20061 is used to verify that there is no constraint in the sub- 



formula that is not in the main formula. Negation-as-absence can be directly 
encoded in CHR, but then it requires two additional rules per negation. In- 
stead, we have chosen to use in the guard of the rule the CHR library built- 
in findconstraint (Var, Pattern, Match) that returns on backtracking all con- 
straints Match that match Pattern and that are indexed on variable Var together 
with negation-as-failure provided by the Prolog built-in \+. 

The translation of the complex rules (15) and (16) of the algorithm require 40 
CHR rules, because several non-trivial new expressions have to be computed. Simp- 
agation rules and auxiliary constraints collect the nested nf/4 constraints, compute 
the reachable variables and atomic formulas, rename the quantified variables and 
produce updated nf/4 and of/2 constraints. In order not to overburden the reader 
with technical details, we omit the description of those 40 rules. 



5.2 Benchmarks: Two partner game 

Let us consider the following two partner game: An ordered pair (z,j) is given, 
with i a non-negative (possibly null) integer and j € {0, 1}. One after another, each 
player changes the values of i and j according to the following rules 

• If J = then the actual player should replace i by i — 1 in the pair («, j). 

• If J = 1 and i is odd then the actual player can either replace i by i + 1 or 
replace j by j — 1, in the pair 

• If J = 1 and i is even then the actual player can either replace i by i + 1 and 
j by J — 1 in the pair or replace only i by i + 1 in the pair let 

The first player who cannot keep i non negative has lost. This game can be repre- 
sented by the following directed infinite graph: 
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(0.0) 



(1,0) (2,0) ■ 



(3,0) 



(4,0) 



.(5,0) 



It is clear that the player which is at the position (0, 0) and should play has lost. 
Suppose that it is the turn of player A to play. A position (n, m) is called k-winning 
if, no matter the way the other player B plays, it is always possible for A to win, 
after having made at most k moves. It is easy to show that 



winningk{x) 



3y move{x, y) A ^( 
3x move{y, x) A ^( 

3y move{x, y) A ^( 
3x move{y, x) A ^( 
false )...) 



'2k 

where move(x, y) means: "starting from the position x we play one time and reach 
the position y" . By moving down the negations, we get an embedding of 2k alter- 
nated quantifiers. 

Suppose that F contains the function symbols 0, 1, /, 5, c of respective arities 
0, 0, 1, 1, 2. We code the vertices of the game graph by the trees c{i,0) and 
c{i, 1) with i = ifgY^^iO) if i is even, and i = g{i — 1) if i is oddj^The relation 
move{x,y) is then defined as follows: 

dci 



with 



transition{x, y) 



move{x, y) <-> transition{x, y) V {^{3uv x — c{u, v)) A x — y) 



3uiViU2V2 

X = c{ui,vi) Ay = c{u2,V2)A 

(ui = A U2 = Wi A pred{ui,U2)) 
V 

{U2 = f{ui) AV2 = Wl)V 
{U2 = Ul A 1^2 = 0) 

{^{3w ui = g{w)) Au2 = g{ui) A {v2 = V W2 = 0)) 



dof 



{vi = lA 



(3w ui = g{w) A 



)V 



{^{vi = 0) A -^{vi = 1) A U2 = Ui A V2 = vi) 



pred{ui,U2) 



dcf 



(3jui = /(j)A 
{3j Ml = g{j) A 



{3kj = g{k)Au2=j)V 
{^{3kj = g{k)) AU2 = ui) 
{3k j = g{k) A U2 = ui)\J 
{^{3kj ^ g{k)) AU2= j) _ 
(-(3.? Ml = f{j)) A -^{3jui = g{j)) A -(wi = 0) A U2 = ui) J 



(/s)O(x) = X and = 
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If we take as input of our solver the formula winningk{x) then we will get as 
output a disjunction of simple formulas where the solutions of the free variable x 
represent all the /c- winning positions. 

For winningi{x) our algorithm gives the following formula: 

3uiU2 X — c(ui, U2) /\ui — g{u2) A U2 = 0, 

which corresponds to the solution x — c(g(0),0). For winning2{x) our algorithm 
gives the following disjunction of simple formulas 

{3uiU2 X = c(mi, U2) A Ui = g{u2) A M2 = 0) 

V 

(3U3U4M5U6 X = c(u3, ue) A M3 = g{u4) A U4 = /("s) A U5 = g{ue) Auq^O) 

which corresponds to the solution x — c{g{0), 0) V a; = c{g{f{g{0))), 0). Note that x 
is the only free variable in the two preceding disjunctions and its solutions represent 
the positions which are fc- winning. 

The times of execution (CPU time in milliseconds) of the formulas winningk{x) 
are given in the following table as well as a comparison with those obtained us- 



ing a decision procedure for decomposable theories (Djelloul 2006a I (even though 



the later does not produce comprehensible results, i.e. explicit solved forms). The 
benchmarks are performed on a 2.5Ghz Pentium IV processor, with 1024Mb of 
RAM. The symbol "-" bellow means exhausting memory. 



k {winning i;{x)) 



1 



7 



10 



20 



40 



I CHR (our 16 rules) I 320 I 690 I 1750 I 2745 I 5390 I - 



C++ (Djelloul 2006a) 28 50 115 150 245 430 2115 



I C++ (our 16 rules) I 25 I 40 I 90 I 115 I 175 I 315 1 1490 1 15910 1 



This decision procedure takes from 10% to 40% more time, comparing with our 
CH — h implementation to solve the winning^ {x) formulas of our game and overflows 
the memory for k > 20, i.e. 40 nested alternated quantifiers. Our C++ implemen- 
tation has better performance and is able to give all the winning^, strategies in a 
clear and explicit way until fc = 40, i.e. 80 nested alternated quantifiers. 

The execution times of winningk{x) using our CHR implementation are 12-30 
times slower than those obtained using our C++ implementation and the maximal 
depth of working formula that can be solved is 14 {k — 7). These results are in line 
with the experience that the overhead of using declarative CHR without optimisa- 
tions induces an overhead of about an order of magnitude over implementations in 
procedural languages. As discussed in the conclusions, switching to a more recent 
optimizing CHR compiler may close the gap to a small constant factor. 

The algorithm given in (Djelloul 2006a I is a decision procedure in the form of 
five rewriting rules which for every decomposable theory T transforms a first-order 
formula ip into a conjunction (j) of final formulas easily transformable into a Boolean 
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combination of existentially quantified conjunctions of atomic formulas. This deci- 
sion procedure does not warrant that the solutions of the free variables are expressed 
in a clear and explicit way and can even produce formulas having free variables but 
being always true or false in T. In fact, for our two player game, we got conjunctions 
of final formulas where the solutions of the free variable x was incomprehensible, 
especially from k = 5. 



We also tried to use Remark 4.4.2 of ( jPjelloul 2006a ) which gives a way to get a 
disjunction of the form 

\/(3x',a;A /\ -(3^:^./?:,.)) (36) 
iei je.Ji 



as output of the decision procedure. As the author of (Djelloul 2006a) wrote: "it 
is more easy to understand the solutions of the free variables of this disjunction of 
solved formulas than those of a conjunction of solved formulas". That is of course 
true, but this does not mean that the solutions of the free variables of this formula 
are expressed in a clear and explicit way. In fact, we got a disjunction of the form 



( 36 1 where many variables which occurred in left hand sides of equations of a[ 
occurred also in left hand sides of equations of some f3lj. Moreover, many formulas 
of the preceding disjunction contained occurrences of the free variable x but after 
a hard and complex manual checking we found them equivalent to false. As a 
consequence, the solutions of x was completely not evident to understand and we 
could not extract clear and understandable winningk{x) strategies for all fc > 5. 



In order to simplify the formula ( 36 1 we finally used our solving algorithm on it 



and have got a disjunction of simple formulas equivalent to (36) in T in which: 
(1) all the formulas having free occurrences of x but being always false in T have 
been removed, (2) the solutions of the free variable x were expressed in a clear and 
explicit way. 

We now discuss why our solver is faster than the decision procedure of K. Djelloul. 
The latter uses many times a particular distribution (rule (5) in ( Djelloul 2006a| ) 



which decreases the depth of the working formulas but increases exponentially the 
number of conjunctions of the working formulas until overflowing the memory. Our 
solving algorithm uses a similar distribution (rule (16)) but only after a necessary 
propagation step which copies the basic formulas into the sub- working formulas and 
checks if there exists no working formulas which contradict their top-working for- 
mula. This step enables us to remove the inconsistent working formulas and to not 
lose time with solving a huge working formulas (i.e. of big depth) which contradicts 
their top- working formulas. It also prevents us from making exponential distribu- 
tions between huge inconsistent working formulas which finally are all equivalent to 
false. Unfortunately, we cannot add this propagation step to the decision procedure 
of CDjelloul 2006a I since it uses many properties which hold only for the theory of 



finite or infinite trees and not for any decomposable theory T. 



The game introduced in this paper was inspired from those given in (Djelloul 



2006a I but is different. Solving a winningj^{x) formula in this game generates many 
huge working formulas which contradict their top-working formulas. Our algorithm 
removes directly these huge working formulas after the first propagation step (rules 
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(1),...,(13)). The decision procedure cannot detect this inconsistency and is obHged 
to apply a costly rule (rule (5) in ( Djelloul 2006a[ )) to decrease the size of these 
inconsistent working formulas until finding basic inconsistent formulas of the form 
-•(a A -'{3e true)) or -^{3e false A (p). At each application of this rule, the depth of 
the working formulas decreases but the number of conjunctions increase exponen- 
tially until overfiowing the memory. This explains why for this game the decision 
procedure overflows the memory for A; > 20 while our solver can compute the 
winningk{x) strategies until k = 40. 



5.3 Benchmarks: Random normalized formulas 

We have also tested our 16 rules on randomly generated normalized formulas such 
that in each sub-normalized formula of the form ^{3xa A Ar=i fi)' with the ipi's 
normalized formulas and n > 0, we have: 

• n is a positive integer randomly chosen between and 4. 

• The number of the atomic formulas in the basic formula a is randomly chosen 
between 1 and 8. Moreover, the atomic formula true occurs at most once in 
a. 

• The vector of variables and the atomic formulas of 3a; a are randomly gen- 
erated starting from a set containing 10 variables, the relation finite and 6 
function symbols: fo, fi, f2, go, gi, 92- Each function symbol fj or gj is of arity 
j with > j > 2. 

The benchmarks were realized on a 2.5Ghz Pentium IV processor with 
1024Mb of RAM as follows: For each integer 1 > d > 42 we gen- 
erated 10 random normalized formulaj^ of depth d, we solved them and 
computed the average execution time (CPU time in milliseconds). Once 
again, the performances (time and space) of our 16 rules are impressive 
comparing with those of the decision procedure for decomposable theories. 



12 



22 



26 



41 



CHR (our 16 rules) I 1526 I 4212 I 16104 I 



C++ (Djelloul 2006a I 108 375 1486 18973 



C++ (our 16 rules) 



202 504 3552 11664 2142824 



Note that for d = 42, all the normalized formulas could not be solved and over- 
flowed the memory. 



We of course renamed the quantified variables of each randomly generated normalized formula 
so that it respects the discipline of the formulas in T 
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6 Discussion and conclusion 

We gave in this paper a first-order axiomatization of an extended theory T of finite 
or infinite trees, built on a signature containing not only an infinite set of function 
symbols but also a relation finite{t) which enables to distinguish between finite or 
infinite trees. We showed that T has at least one model and proved its completeness 
by giving not only a decision procedure but a full first-order constraint solver which 
transforms any first-order constraint ip into an equivalent disjunction of simple 
formulas such that is either the formula true, or the formula false, or a formula 
having at least one free variable, being equivalent neither to true nor to false and 
where the solutions of the free variables are expressed in a clear and explicit way. 
This algorithm detects easily formulas that have free variables but are always true or 
always false in T and is able to solve any first-order constraint satisfaction problem 
in T. Its correctness implies the completeness of T. 



On the other hand S. Vorobyov (Vorobyov 19961 has shown that the problem 



of deciding if a proposition is true or not in the theory of finite or infinite trees 
is non-elementary, i.e. the complexity of all algorithms solving propositions is not 
bounded by a tower of powers of 2's (top down evaluation) with a fixed height. 



A. Colmerauer and T. Dao (Colmerauer and Dao 20031 have also given a proof of 



non-elementary complexity of solving constraints in this theory. As a consequence, 
our algorithm does not escape this huge complexity and the function a(ip) used to 
show the termination of our rules illustrates this result. 

We implemented our algorithm in C-I--I- and CHR and compared both perfor- 
mances with those obtained using a recent decision procedure for decomposable 



theories (Djelloul 2006a I. This decision procedure is not able to present the solu- 
tions of the free variables in a clear and explicit way and overflows the memory 
while solving normalized formulas with depth d > 40. Our C++ implementation 
is faster than this decision procedure and can solve normalized formulas of depth 
d = 80. This is mainly due to the fact that our algorithm uses two steps: (1) a 
top-down propagation of constraints and (2) a bottom-up elimination of quantifiers 
and depth reduction of the working formulas. In particular, the first step enables to 
minimize the number of application of costly distributions and avoids to lose time 
with solving huge formulas which contradict their top-formulas. 

Future implementation work will focus on our CHR implementation, since from 
previous experience we are confident that we can get the performance overhead 
down to a small constant factor while gaining the possibility to prototype variations 
of our algorithm in a very high level language. Switching to a more recent optimizing 
CHR compiler from K.U. Leuven would most likely improve performance. We also 



think that we can minimize the use of the debated negation-as-absence ( Van Weert 



et al. 2006 1 by introducing reference counters for the two main constraints. This 
should also give us the possibility to obtain a parallel implementation that is derived 
from the existing one with little modification, similar to what has been done for 



parallelizing the union- find algorithm in CHR (Fruehwirth 2005). 
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